public inbox for libc-locales@sourceware.org
 help / color / mirror / Atom feed
* [Bug localedata/13063] New: Can not 'sort -u' all Chinese characters in CJK UNIFIED IDEOGRAPH EXTENSION A/B/C/D
@ 2011-08-06 19:28 an.euroford at gmail dot com
  2011-08-06 19:28 ` [Bug localedata/13063] " an.euroford at gmail dot com
                   ` (9 more replies)
  0 siblings, 10 replies; 11+ messages in thread
From: an.euroford at gmail dot com @ 2011-08-06 19:28 UTC (permalink / raw)
  To: libc-locales

http://sourceware.org/bugzilla/show_bug.cgi?id=13063

           Summary: Can not 'sort -u' all Chinese characters in CJK
                    UNIFIED IDEOGRAPH EXTENSION A/B/C/D
           Product: glibc
           Version: unspecified
            Status: NEW
          Severity: critical
          Priority: P2
         Component: localedata
        AssignedTo: libc-locales@sources.redhat.com
        ReportedBy: an.euroford@gmail.com


Hi,

Refer to glibc/localedata/locales/zh_CN and iso14651_t1_pinyin or
iso14651_t1, glibc just support unicode3.0.

The new version of unicode is 6.0, it extend CJK UNIFIED IDEOGRAPH with
extension A/B/C/D, and extension A is included in GB18030:2005( China
locale charset standard).

So at least, glibc should sort all Chinese characters in CJK UNIFIED IDEOGRAPH
and EXTENSIONA(U+3400-U+4DBF).

The real effect is sort -u.
If you execute sort -u examples_CJK_extensionA.txt (see attachment), you
will got only one Chinese character "㑗".


Regards,
An Yang

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2017-07-20  8:02 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-08-06 19:28 [Bug localedata/13063] New: Can not 'sort -u' all Chinese characters in CJK UNIFIED IDEOGRAPH EXTENSION A/B/C/D an.euroford at gmail dot com
2011-08-06 19:28 ` [Bug localedata/13063] " an.euroford at gmail dot com
2011-08-07 20:46 ` [Bug localedata/13063] 'sort -u' will erase some Chinese characters an.euroford at gmail dot com
2011-08-07 20:47 ` an.euroford at gmail dot com
2011-08-08 16:56 ` an.euroford at gmail dot com
2014-05-07  8:20 ` bluebat at member dot fsf.org
2014-06-13 15:12 ` fweimer at redhat dot com
2014-11-19  4:14 ` bluebat at member dot fsf.org
2017-01-22 23:58 ` arthur200126 at gmail dot com
2017-07-19 16:16 ` maiku.fabian at gmail dot com
2017-07-20  8:02 ` maiku.fabian at gmail dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).