public inbox for libc-locales@sourceware.org
 help / color / mirror / Atom feed
* [Bug localedata/13061] iconv mapping of 0xA8 0xEC in CP1258 is non-canonical
  2011-08-06 19:30 [Bug localedata/13061] New: iconv mapping of 0xA8 0xEC in CP1258 is non-canonical bruno at clisp dot org
@ 2011-08-06 19:30 ` bruno at clisp dot org
  2011-12-23 14:05 ` drepper.fsp at gmail dot com
  2014-06-27 12:45 ` fweimer at redhat dot com
  2 siblings, 0 replies; 4+ messages in thread
From: bruno at clisp dot org @ 2011-08-06 19:30 UTC (permalink / raw)
  To: libc-locales

http://sourceware.org/bugzilla/show_bug.cgi?id=13061

--- Comment #1 from Bruno Haible <bruno at clisp dot org> 2011-08-06 16:35:06 UTC ---
Created attachment 5878
  --> http://sourceware.org/bugzilla/attachment.cgi?id=5878
proposed fix

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug localedata/13061] New: iconv mapping of 0xA8 0xEC in CP1258 is non-canonical
@ 2011-08-06 19:30 bruno at clisp dot org
  2011-08-06 19:30 ` [Bug localedata/13061] " bruno at clisp dot org
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: bruno at clisp dot org @ 2011-08-06 19:30 UTC (permalink / raw)
  To: libc-locales

http://sourceware.org/bugzilla/show_bug.cgi?id=13061

           Summary: iconv mapping of 0xA8 0xEC in CP1258 is non-canonical
           Product: glibc
           Version: 2.14
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
        AssignedTo: libc-locales@sources.redhat.com
        ReportedBy: bruno@clisp.org


Bug 12777 <http://sourceware.org/bugzilla/show_bug.cgi?id=12777>
was fixed to map U+0385 (like U+1FEE) to 0xA8 0xEC. Good.

But at the same time, in the reverse direction, 0xA8 0xEC ought to map to
U+0385, not to U+1FEE. Why?

1) http://www.unicode.org/charts/PDF/U1F00.pdf states
that the decomposition of U+1FEE is U+0385. That is, U+0385 is a "simpler"
Unicode character than U+1FEE, although both look very similar
(cf. http://www.unicode.org/charts/PDF/U1F00.pdf and
http://www.unicode.org/charts/PDF/U0370.pdf).

2) According to http://www.unicode.org/versions/Unicode6.0.0/ch07.pdf,
the block U+0370..U+03FF is more for modern Greek, whereas the block
U+1F00..U+1FFF is mostly for ancient Greek. But CP1258 is about modern Greek.

To reproduce:

$ printf '\xA8\xEC' | iconv -f CP1258 -t UCS-4LE | od -t x4
0000000 00001fee
0000004

Should be:

$ printf '\xA8\xEC' | iconv -f CP1258 -t UCS-4LE | od -t x4
0000000 00000385
0000004

Attached is probable fix (untested).

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug localedata/13061] iconv mapping of 0xA8 0xEC in CP1258 is non-canonical
  2011-08-06 19:30 [Bug localedata/13061] New: iconv mapping of 0xA8 0xEC in CP1258 is non-canonical bruno at clisp dot org
  2011-08-06 19:30 ` [Bug localedata/13061] " bruno at clisp dot org
@ 2011-12-23 14:05 ` drepper.fsp at gmail dot com
  2014-06-27 12:45 ` fweimer at redhat dot com
  2 siblings, 0 replies; 4+ messages in thread
From: drepper.fsp at gmail dot com @ 2011-12-23 14:05 UTC (permalink / raw)
  To: libc-locales

http://sourceware.org/bugzilla/show_bug.cgi?id=13061

Ulrich Drepper <drepper.fsp at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
                 CC|                            |drepper.fsp at gmail dot
                   |                            |com
         Resolution|                            |FIXED

--- Comment #2 from Ulrich Drepper <drepper.fsp at gmail dot com> 2011-12-23 13:44:13 UTC ---
I applied the patch.

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* [Bug localedata/13061] iconv mapping of 0xA8 0xEC in CP1258 is non-canonical
  2011-08-06 19:30 [Bug localedata/13061] New: iconv mapping of 0xA8 0xEC in CP1258 is non-canonical bruno at clisp dot org
  2011-08-06 19:30 ` [Bug localedata/13061] " bruno at clisp dot org
  2011-12-23 14:05 ` drepper.fsp at gmail dot com
@ 2014-06-27 12:45 ` fweimer at redhat dot com
  2 siblings, 0 replies; 4+ messages in thread
From: fweimer at redhat dot com @ 2014-06-27 12:45 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=13061

Florian Weimer <fweimer at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
              Flags|                            |security-

-- 
You are receiving this mail because:
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2014-06-27 12:45 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-08-06 19:30 [Bug localedata/13061] New: iconv mapping of 0xA8 0xEC in CP1258 is non-canonical bruno at clisp dot org
2011-08-06 19:30 ` [Bug localedata/13061] " bruno at clisp dot org
2011-12-23 14:05 ` drepper.fsp at gmail dot com
2014-06-27 12:45 ` fweimer at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).