public inbox for libc-locales@sourceware.org
 help / color / mirror / Atom feed
* [Bug localedata/29506] New: UTF-8 HANGUL SYLLABLE bugs
@ 2022-08-19 13:54 jakub at redhat dot com
  2024-01-14 10:36 ` [Bug localedata/29506] " maiku.fabian at gmail dot com
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: jakub at redhat dot com @ 2022-08-19 13:54 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=29506

            Bug ID: 29506
           Summary: UTF-8 HANGUL SYLLABLE bugs
           Product: glibc
           Version: 2.38
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
          Assignee: unassigned at sourceware dot org
          Reporter: jakub at redhat dot com
                CC: libc-locales at sourceware dot org
  Target Milestone: ---

localedata/unicode-gen/utf8_gen.py
lists the 6th element of JAMO_FINAL_SHORT_NAME as NI, but according to Unicode
(all I've checked, Unicode claims names are immutable) it should be NJ.
See https://www.unicode.org/Public/4.1.0/ucd/Jamo.txt
11AC; NJ  # HANGUL JONGSEONG NIEUN-CIEUC
or
https://www.unicode.org/Public/14.0.0/ucd/Jamo.txt
11AC; NJ  # HANGUL JONGSEONG NIEUN-CIEUC

This means that UTF-8 contains entries like:
<UAD8D>     /xea/xb6/x8d HANGUL SYLLABLE GWEONI
that my Unicode name to codepoint function can't recognize, while
it can map "HANGUL SYLLABLE GWEONJ" to U+AD8D.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2024-01-14 17:02 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-08-19 13:54 [Bug localedata/29506] New: UTF-8 HANGUL SYLLABLE bugs jakub at redhat dot com
2024-01-14 10:36 ` [Bug localedata/29506] " maiku.fabian at gmail dot com
2024-01-14 17:00 ` cvs-commit at gcc dot gnu.org
2024-01-14 17:02 ` maiku.fabian at gmail dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).