public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug localedata/16061] New: Review / update transliteration data
@ 2013-10-18  8:04 myllynen at redhat dot com
  2014-02-18  9:24 ` [Bug localedata/16061] " pravin.d.s at gmail dot com
                   ` (6 more replies)
  0 siblings, 7 replies; 8+ messages in thread
From: myllynen at redhat dot com @ 2013-10-18  8:04 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=16061

            Bug ID: 16061
           Summary: Review / update transliteration data
           Product: glibc
           Version: 2.18
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
          Assignee: unassigned at sourceware dot org
          Reporter: myllynen at redhat dot com
                CC: libc-locales at sourceware dot org

The localedata/locales/translit_* files are probably, based on comments in
them, at least partially generated from some version of UnicodeData.txt (based
on 93a568 it looks like the last major update has been for Unicode 3.2 and
17b16e suggests them originally coming from an external contributor). However,
there are some characters missing even from the Latin-1 Supplement block and in
general it doesn't seem possible to update the files just by using
UnicodeData.txt. Some of the rules live in locale/C-translit.h /
locale/C-translit.h.in which also contain local changes (like 61d5a6 / 2a81ea).

It requires likely a lot of work to understand how the files have been
generated in the first place, how to identify relevant local changes, and how
to automate the process to update them in the future.

Some individual examples of currently missing characters are U+00D8 (Ø) and
U+0110 (Đ) whereas other characters like U+00C6 (Æ) and U+0141 (Ł) from their
blocks (Latin-1 Supplement and Latin Extended-A, respectively) are present.
Some characters (like U+2033, ″) have decomposition defined as is in Unicode
but some characters (like U+00D6, Ö) have decomposition defined in Unicode but
not in glibc.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
>From glibc-bugs-return-19859-listarch-glibc-bugs=sources.redhat.com@sourceware.org Fri Oct 18 11:12:01 2013
Return-Path: <glibc-bugs-return-19859-listarch-glibc-bugs=sources.redhat.com@sourceware.org>
Delivered-To: listarch-glibc-bugs@sources.redhat.com
Received: (qmail 11131 invoked by alias); 18 Oct 2013 11:12:01 -0000
Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm
Precedence: bulk
List-Id: <glibc-bugs.sourceware.org>
List-Subscribe: <mailto:glibc-bugs-subscribe@sourceware.org>
List-Post: <mailto:glibc-bugs@sourceware.org>
List-Help: <mailto:glibc-bugs-help@sourceware.org>, <http://sourceware.org/lists.html#faqs>
Sender: glibc-bugs-owner@sourceware.org
Delivered-To: mailing list glibc-bugs@sourceware.org
Received: (qmail 11083 invoked by uid 48); 18 Oct 2013 11:11:58 -0000
From: "bugdal at aerifal dot cx" <sourceware-bugzilla@sourceware.org>
To: glibc-bugs@sourceware.org
Subject: [Bug stdio/5994] fflush after ungetc on seekable input stream
Date: Fri, 18 Oct 2013 11:12:00 -0000
X-Bugzilla-Reason: CC
X-Bugzilla-Type: changed
X-Bugzilla-Watch-Reason: None
X-Bugzilla-Product: glibc
X-Bugzilla-Component: stdio
X-Bugzilla-Version: 2.3.4
X-Bugzilla-Keywords:
X-Bugzilla-Severity: normal
X-Bugzilla-Who: bugdal at aerifal dot cx
X-Bugzilla-Status: NEW
X-Bugzilla-Priority: P2
X-Bugzilla-Assigned-To: drepper.fsp at gmail dot com
X-Bugzilla-Target-Milestone: ---
X-Bugzilla-Flags:
X-Bugzilla-Changed-Fields:
Message-ID: <bug-5994-131-xFI3BXNvYR@http.sourceware.org/bugzilla/>
In-Reply-To: <bug-5994-131@http.sourceware.org/bugzilla/>
References: <bug-5994-131@http.sourceware.org/bugzilla/>
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: 7bit
X-Bugzilla-URL: http://sourceware.org/bugzilla/
Auto-Submitted: auto-generated
MIME-Version: 1.0
X-SW-Source: 2013-10/txt/msg00218.txt.bz2
Content-length: 464

https://sourceware.org/bugzilla/show_bug.cgi?idY94

--- Comment #7 from Rich Felker <bugdal at aerifal dot cx> ---
Thanks Eric. I tested again and got the same results. I was just uncertain,
with this bug having been around so long, whether anything had changed since it
was first reported. Now I guess we need someone familiar with the code to look
at what's involved in fixing it.

--
You are receiving this mail because:
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2015-05-04 10:42 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-10-18  8:04 [Bug localedata/16061] New: Review / update transliteration data myllynen at redhat dot com
2014-02-18  9:24 ` [Bug localedata/16061] " pravin.d.s at gmail dot com
2014-06-13 12:38 ` fweimer at redhat dot com
2014-10-10 15:26 ` maiku.fabian at gmail dot com
2015-04-28 17:30 ` maiku.fabian at gmail dot com
2015-04-29  7:12 ` maiku.fabian at gmail dot com
2015-05-04  7:53 ` myllynen at redhat dot com
2015-05-04 10:42 ` maiku.fabian at gmail dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).