public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug locale/30243] New: GB18030-2022 is not supported!
@ 2023-03-18 17:05 starcold14 at gmail dot com
  2023-03-18 17:12 ` [Bug locale/30243] " starcold14 at gmail dot com
                   ` (23 more replies)
  0 siblings, 24 replies; 25+ messages in thread
From: starcold14 at gmail dot com @ 2023-03-18 17:05 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

            Bug ID: 30243
           Summary: GB18030-2022 is not supported!
           Product: glibc
           Version: unspecified
            Status: UNCONFIRMED
          Severity: normal
          Priority: P2
         Component: locale
          Assignee: unassigned at sourceware dot org
          Reporter: starcold14 at gmail dot com
  Target Milestone: ---

GB18030-2022 is coming and glic use GB18030-2005 standard which should be
updated!
The changes of GB18030-2022 can be found at Doctor Ken Lunde's
article(https://ken-lunde.medium.com/the-gb-18030-2022-standard-3d0ebaeb4132).

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
@ 2023-03-18 17:12 ` starcold14 at gmail dot com
  2023-04-19  9:44 ` starcold14 at gmail dot com
                   ` (22 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: starcold14 at gmail dot com @ 2023-03-18 17:12 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #1 from starcold14 <starcold14 at gmail dot com> ---
https://bugs.openjdk.org/browse/JDK-8301119?attachmentViewMode=gallery
JDK has updated the standard. Following above link gets the completed charmap.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
  2023-03-18 17:12 ` [Bug locale/30243] " starcold14 at gmail dot com
@ 2023-04-19  9:44 ` starcold14 at gmail dot com
  2023-05-20 22:52 ` bruno at clisp dot org
                   ` (21 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: starcold14 at gmail dot com @ 2023-04-19  9:44 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

starcold14 <starcold14 at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Severity|normal                      |critical
            Version|unspecified                 |2.39
           Priority|P2                          |P1

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
  2023-03-18 17:12 ` [Bug locale/30243] " starcold14 at gmail dot com
  2023-04-19  9:44 ` starcold14 at gmail dot com
@ 2023-05-20 22:52 ` bruno at clisp dot org
  2023-05-21 11:22 ` schwab@linux-m68k.org
                   ` (20 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bruno at clisp dot org @ 2023-05-20 22:52 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

Bruno Haible <bruno at clisp dot org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |bruno at clisp dot org

--- Comment #2 from Bruno Haible <bruno at clisp dot org> ---
Created attachment 14890
  --> https://sourceware.org/bugzilla/attachment.cgi?id=14890&action=edit
mapping tables

The official GB18030-2022 mapping table can be downloaded from
http://www.nits.org.cn/index/article/4034 (two data files).

The difference between GB18030-2005 and GB18030-2022, regarding the mapping
tables, is that GB18030-2022 gets rid of a PUA (private use area) mapping of
some characters that were not part of Unicode in 2005 but are in Unicode
nowadays. In other words, these PUA mappings are considered obsolete.

Find attached a tar file with
1) The current mapping tables from glibc (extracted from glibc 2.35, but it
hasn't changed since then),
2) The mapping tables from GNU libiconv.
The <encoding>.TXT files describe the multibyte to Unicode conversion
direction; the <encoding>.INVERSE.TXT files describe the Unicode to multibyte
conversion direction.

What needs to be done in glibc?

* For the multibyte to Unicode conversion direction: Look at "diff -u
glibc-2.35-iconv/GB18030.TXT libiconv/GB18030-2022.TXT"
  - Mappings for 0x82359037..0x82359134 and 0x84318236..0x84318335 need to be
added.
  - The mappings of 0xFE51, 0xFE52, 0xFE53, 0xFE6C, 0xFE76, 0xFE91 need to be
changed.

* For the Unicode to multibyte conversion direction: Look at "diff -u
glibc-2.35-iconv/GB18030.INVERSE.TXT libiconv/GB18030-2022.INVERSE.TXT"
  - Mappings for U+E81E, U+E826, U+E82B, U+E82C, U+E832, U+E843, U+E854,
U+E864, U+E78D..U+E796 need to be added.
  - The mappings of U+20087, U+20089, U+200CC, U+215D7, U+2298F, U+241FE need
to be changed.
  - Mappings for U+E816, U+E817, U+E818, U+E831, U+E83B, U+E855 need to be
added.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (2 preceding siblings ...)
  2023-05-20 22:52 ` bruno at clisp dot org
@ 2023-05-21 11:22 ` schwab@linux-m68k.org
  2023-06-16  1:28 ` liqingqing3 at huawei dot com
                   ` (19 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: schwab@linux-m68k.org @ 2023-05-21 11:22 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #3 from Andreas Schwab <schwab@linux-m68k.org> ---
Feel free to propose a patch.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (3 preceding siblings ...)
  2023-05-21 11:22 ` schwab@linux-m68k.org
@ 2023-06-16  1:28 ` liqingqing3 at huawei dot com
  2023-06-16  1:54 ` bruno at clisp dot org
                   ` (18 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: liqingqing3 at huawei dot com @ 2023-06-16  1:28 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

liqingqing <liqingqing3 at huawei dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |liqingqing3 at huawei dot com

--- Comment #4 from liqingqing <liqingqing3 at huawei dot com> ---
we have finished a patch for this, please review, thanks!
https://patchwork.sourceware.org/project/glibc/patch/20230615113800.2174-1-lijianglin2@huawei.com/

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (4 preceding siblings ...)
  2023-06-16  1:28 ` liqingqing3 at huawei dot com
@ 2023-06-16  1:54 ` bruno at clisp dot org
  2023-06-16  1:54 ` bruno at clisp dot org
                   ` (17 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bruno at clisp dot org @ 2023-06-16  1:54 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #5 from Bruno Haible <bruno at clisp dot org> ---
Created attachment 14930
  --> https://sourceware.org/bugzilla/attachment.cgi?id=14930&action=edit
Part 1 of a draft proposed fix

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (5 preceding siblings ...)
  2023-06-16  1:54 ` bruno at clisp dot org
@ 2023-06-16  1:54 ` bruno at clisp dot org
  2023-06-16  2:01 ` bruno at clisp dot org
                   ` (16 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bruno at clisp dot org @ 2023-06-16  1:54 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #6 from Bruno Haible <bruno at clisp dot org> ---
Created attachment 14931
  --> https://sourceware.org/bugzilla/attachment.cgi?id=14931&action=edit
Part 2 of a draft proposed fix

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (6 preceding siblings ...)
  2023-06-16  1:54 ` bruno at clisp dot org
@ 2023-06-16  2:01 ` bruno at clisp dot org
  2023-06-25  6:21 ` lijianglin2 at huawei dot com
                   ` (15 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bruno at clisp dot org @ 2023-06-16  2:01 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #7 from Bruno Haible <bruno at clisp dot org> ---
I think
1) It would be useful to change the unit tests to test the entire GB18030
charmap, not only the BMP part. Find attached a draft patch to that effect. I'm
not happy with that; I intend to simplify it more.
2) My patch (part 2) also removes a few lines of "/* Handle a few special
cases.  */" in iconvdata/gb18030.c. Yours doesn't.
3) The comment "The newest GB 18030-2005 standard still uses some private use
area code points. ..." in localedata/charmaps/GB18030 should be removed, since
it does not reflect reality any more. GB 18030-2005 is no longer the newest
one. The newest one, from 2022, dropped the particular use of private use area
code points.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (7 preceding siblings ...)
  2023-06-16  2:01 ` bruno at clisp dot org
@ 2023-06-25  6:21 ` lijianglin2 at huawei dot com
  2023-06-27  6:15 ` lijianglin2 at huawei dot com
                   ` (14 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: lijianglin2 at huawei dot com @ 2023-06-25  6:21 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

lijianglin <lijianglin2 at huawei dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |lijianglin2 at huawei dot com

--- Comment #8 from lijianglin <lijianglin2 at huawei dot com> ---
(In reply to Bruno Haible from comment #7)
> I think
> 1) It would be useful to change the unit tests to test the entire GB18030
> charmap, not only the BMP part. Find attached a draft patch to that effect.
> I'm not happy with that; I intend to simplify it more.
> 2) My patch (part 2) also removes a few lines of "/* Handle a few special
> cases.  */" in iconvdata/gb18030.c. Yours doesn't.
> 3) The comment "The newest GB 18030-2005 standard still uses some private
> use area code points. ..." in localedata/charmaps/GB18030 should be removed,
> since it does not reflect reality any more. GB 18030-2005 is no longer the
> newest one. The newest one, from 2022, dropped the particular use of private
> use area code points.

Thank you for your suggestion. I will include these in my patch

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (8 preceding siblings ...)
  2023-06-25  6:21 ` lijianglin2 at huawei dot com
@ 2023-06-27  6:15 ` lijianglin2 at huawei dot com
  2023-06-28  8:45 ` lijianglin2 at huawei dot com
                   ` (13 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: lijianglin2 at huawei dot com @ 2023-06-27  6:15 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #9 from lijianglin <lijianglin2 at huawei dot com> ---
we have updated the patch, please review, thanks!
https://patchwork.sourceware.org/project/glibc/patch/20230627034706.3053-1-lijianglin2@huawei.com/

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (9 preceding siblings ...)
  2023-06-27  6:15 ` lijianglin2 at huawei dot com
@ 2023-06-28  8:45 ` lijianglin2 at huawei dot com
  2023-06-28 14:33 ` liqingqing3 at huawei dot com
                   ` (12 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: lijianglin2 at huawei dot com @ 2023-06-28  8:45 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #10 from lijianglin <lijianglin2 at huawei dot com> ---
(In reply to lijianglin from comment #9)
> we have updated the patch, please review, thanks!
> https://patchwork.sourceware.org/project/glibc/patch/20230627034706.3053-1-
> lijianglin2@huawei.com/

we have adjusted localedata/charmaps/GB18030,the latest patch(v3) as follows
https://patchwork.sourceware.org/project/glibc/patch/20230627121549.3431-1-lijianglin2@huawei.com/

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (10 preceding siblings ...)
  2023-06-28  8:45 ` lijianglin2 at huawei dot com
@ 2023-06-28 14:33 ` liqingqing3 at huawei dot com
  2023-07-03  8:09 ` lijianglin2 at huawei dot com
                   ` (11 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: liqingqing3 at huawei dot com @ 2023-06-28 14:33 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #11 from liqingqing <liqingqing3 at huawei dot com> ---
(In reply to Bruno Haible from comment #7)
> I think
> 1) It would be useful to change the unit tests to test the entire GB18030
> charmap, not only the BMP part. Find attached a draft patch to that effect.
> I'm not happy with that; I intend to simplify it more.
> 2) My patch (part 2) also removes a few lines of "/* Handle a few special
> cases.  */" in iconvdata/gb18030.c. Yours doesn't.
> 3) The comment "The newest GB 18030-2005 standard still uses some private
> use area code points. ..." in localedata/charmaps/GB18030 should be removed,
> since it does not reflect reality any more. GB 18030-2005 is no longer the
> newest one. The newest one, from 2022, dropped the particular use of private
> use area code points.

ok, thanks!

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (11 preceding siblings ...)
  2023-06-28 14:33 ` liqingqing3 at huawei dot com
@ 2023-07-03  8:09 ` lijianglin2 at huawei dot com
  2023-07-26 19:14 ` jamborm at gcc dot gnu.org
                   ` (10 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: lijianglin2 at huawei dot com @ 2023-07-03  8:09 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #12 from lijianglin <lijianglin2 at huawei dot com> ---
(In reply to lijianglin from comment #10)
> (In reply to lijianglin from comment #9)
> > we have updated the patch, please review, thanks!
> > https://patchwork.sourceware.org/project/glibc/patch/20230627034706.3053-1-
> > lijianglin2@huawei.com/
> 
> we have adjusted localedata/charmaps/GB18030,the latest patch(v3) as follows
> https://patchwork.sourceware.org/project/glibc/patch/20230627121549.3431-1-
> lijianglin2@huawei.com/

Does anyone have focus on this patch?

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (12 preceding siblings ...)
  2023-07-03  8:09 ` lijianglin2 at huawei dot com
@ 2023-07-26 19:14 ` jamborm at gcc dot gnu.org
  2023-08-16 13:26 ` matz at suse dot de
                   ` (9 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: jamborm at gcc dot gnu.org @ 2023-07-26 19:14 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

Martin Jambor <jamborm at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jamborm at gcc dot gnu.org

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (13 preceding siblings ...)
  2023-07-26 19:14 ` jamborm at gcc dot gnu.org
@ 2023-08-16 13:26 ` matz at suse dot de
  2023-08-16 19:20 ` carlos at redhat dot com
                   ` (8 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: matz at suse dot de @ 2023-08-16 13:26 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

Michael Matz <matz at suse dot de> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |carlos at redhat dot com,
                   |                            |matz at suse dot de

--- Comment #13 from Michael Matz <matz at suse dot de> ---
Could this be given some attention please?  AFAICS the last version of the
patch didn't receive further suggestions, so maybe it's good to go? 
(Background:
people here are starting to worry about gb18030-2022 conformance, and ideally
we
want to include/backport only something that upstream also has)

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] GB18030-2022 is not supported!
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (14 preceding siblings ...)
  2023-08-16 13:26 ` matz at suse dot de
@ 2023-08-16 19:20 ` carlos at redhat dot com
  2023-08-16 19:20 ` [Bug locale/30243] Add full support for GB18030-2022 carlos at redhat dot com
                   ` (7 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: carlos at redhat dot com @ 2023-08-16 19:20 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #14 from Carlos O'Donell <carlos at redhat dot com> ---
(In reply to Michael Matz from comment #13)
> Could this be given some attention please?  AFAICS the last version of the
> patch didn't receive further suggestions, so maybe it's good to go? 
> (Background:
> people here are starting to worry about gb18030-2022 conformance, and
> ideally we
> want to include/backport only something that upstream also has)

I agree completely that this needs review.

I reviewed v1, Andreas reviewed v2. The patch is currently at v3, and that
version needs review:
https://patchwork.sourceware.org/project/glibc/patch/20230627121549.3431-1-lijianglin2@huawei.com/

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] Add full support for GB18030-2022
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (15 preceding siblings ...)
  2023-08-16 19:20 ` carlos at redhat dot com
@ 2023-08-16 19:20 ` carlos at redhat dot com
  2023-08-16 20:22 ` maiku.fabian at gmail dot com
                   ` (6 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: carlos at redhat dot com @ 2023-08-16 19:20 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

Carlos O'Donell <carlos at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|GB18030-2022 is not         |Add full support for
                   |supported!                  |GB18030-2022

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] Add full support for GB18030-2022
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (16 preceding siblings ...)
  2023-08-16 19:20 ` [Bug locale/30243] Add full support for GB18030-2022 carlos at redhat dot com
@ 2023-08-16 20:22 ` maiku.fabian at gmail dot com
  2023-08-16 20:28 ` carlos at redhat dot com
                   ` (5 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: maiku.fabian at gmail dot com @ 2023-08-16 20:22 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

Mike FABIAN <maiku.fabian at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |maiku.fabian at gmail dot com

--- Comment #15 from Mike FABIAN <maiku.fabian at gmail dot com> ---
I tested v3 a while ago (2023-07-26). I think it is good. 


I could compile and install it. I wrote a small python test program to test
iconv with and without the patch.

https://mfabian.fedorapeople.org/misc/iconv-test.py

 The patch made all the codepoints mentioned in the tables in

https://www.unicode.org/L2/L2022/22274-disruptive-changes.pdf

work correctly for the 2022 version of the standard.

Without the patch, only the two byte GB18030 code points in the Annex A table
work, the 4 byte GB18030 codepoints work only with the patch. 

The patch is also needed to make the conversions in the Annex B and Annex C
tables work.

By the way, the Annex C table in the above PDF contains a typo in the first
column, last row: it should be U+8612, not U+8162.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] Add full support for GB18030-2022
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (17 preceding siblings ...)
  2023-08-16 20:22 ` maiku.fabian at gmail dot com
@ 2023-08-16 20:28 ` carlos at redhat dot com
  2023-08-17  1:55 ` liqingqing3 at huawei dot com
                   ` (4 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: carlos at redhat dot com @ 2023-08-16 20:28 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #16 from Carlos O'Donell <carlos at redhat dot com> ---
(In reply to Mike FABIAN from comment #15)
> I tested v3 a while ago (2023-07-26). I think it is good. 
> By the way, the Annex C table in the above PDF contains a typo in the first
> column, last row: it should be U+8612, not U+8162.

Does v3 correct this problem?

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] Add full support for GB18030-2022
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (18 preceding siblings ...)
  2023-08-16 20:28 ` carlos at redhat dot com
@ 2023-08-17  1:55 ` liqingqing3 at huawei dot com
  2023-08-17 10:25 ` maiku.fabian at gmail dot com
                   ` (3 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: liqingqing3 at huawei dot com @ 2023-08-17  1:55 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #17 from liqingqing <liqingqing3 at huawei dot com> ---
(In reply to Carlos O'Donell from comment #16)
> (In reply to Mike FABIAN from comment #15)
> > I tested v3 a while ago (2023-07-26). I think it is good. 
> > By the way, the Annex C table in the above PDF contains a typo in the first
> > column, last row: it should be U+8612, not U+8162.
> 
> Does v3 correct this problem?

(In reply to Mike FABIAN from comment #15)
> I tested v3 a while ago (2023-07-26). I think it is good. 
> 
> 
> I could compile and install it. I wrote a small python test program to test
> iconv with and without the patch.
> 
> https://mfabian.fedorapeople.org/misc/iconv-test.py
> 
>  The patch made all the codepoints mentioned in the tables in
> 
> https://www.unicode.org/L2/L2022/22274-disruptive-changes.pdf
> 
> work correctly for the 2022 version of the standard.
> 
> Without the patch, only the two byte GB18030 code points in the Annex A
> table work, the 4 byte GB18030 codepoints work only with the patch. 
> 
> The patch is also needed to make the conversions in the Annex B and Annex C
> tables work.
> 
> By the way, the Annex C table in the above PDF contains a typo in the first
> column, last row: it should be U+8612, not U+8162.

yes, you are right, I checked the GB18030-2022 standard,
http://c.gb688.cn/bzgk/gb/showGb?type=online&hcno=A1931A578FE14957104988029B0833D3

page67 list the below code point: 
unicode code point U+8612 <--> GB18030 code point: CC55

so, the above PDF is  wrong, it should be U+8612

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] Add full support for GB18030-2022
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (19 preceding siblings ...)
  2023-08-17  1:55 ` liqingqing3 at huawei dot com
@ 2023-08-17 10:25 ` maiku.fabian at gmail dot com
  2023-08-29 17:04 ` maiku.fabian at gmail dot com
                   ` (2 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: maiku.fabian at gmail dot com @ 2023-08-17 10:25 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #18 from Mike FABIAN <maiku.fabian at gmail dot com> ---
(In reply to Carlos O'Donell from comment #16)
> (In reply to Mike FABIAN from comment #15)
> > I tested v3 a while ago (2023-07-26). I think it is good. 
> > By the way, the Annex C table in the above PDF contains a typo in the first
> > column, last row: it should be U+8612, not U+8162.
> 
> Does v3 correct this problem?

Yes, v3 is correct.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] Add full support for GB18030-2022
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (20 preceding siblings ...)
  2023-08-17 10:25 ` maiku.fabian at gmail dot com
@ 2023-08-29 17:04 ` maiku.fabian at gmail dot com
  2023-08-29 17:04 ` maiku.fabian at gmail dot com
  2023-08-29 17:31 ` carlos at redhat dot com
  23 siblings, 0 replies; 25+ messages in thread
From: maiku.fabian at gmail dot com @ 2023-08-29 17:04 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

--- Comment #19 from Mike FABIAN <maiku.fabian at gmail dot com> ---
Patch v3 pushed to git master.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] Add full support for GB18030-2022
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (21 preceding siblings ...)
  2023-08-29 17:04 ` maiku.fabian at gmail dot com
@ 2023-08-29 17:04 ` maiku.fabian at gmail dot com
  2023-08-29 17:31 ` carlos at redhat dot com
  23 siblings, 0 replies; 25+ messages in thread
From: maiku.fabian at gmail dot com @ 2023-08-29 17:04 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

Mike FABIAN <maiku.fabian at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Assignee|unassigned at sourceware dot org   |maiku.fabian at gmail dot com

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug locale/30243] Add full support for GB18030-2022
  2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
                   ` (22 preceding siblings ...)
  2023-08-29 17:04 ` maiku.fabian at gmail dot com
@ 2023-08-29 17:31 ` carlos at redhat dot com
  23 siblings, 0 replies; 25+ messages in thread
From: carlos at redhat dot com @ 2023-08-29 17:31 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=30243

Carlos O'Donell <carlos at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|---                         |2.39

--- Comment #20 from Carlos O'Donell <carlos at redhat dot com> ---
(In reply to Mike FABIAN from comment #19)
> Patch v3 pushed to git master.

Thank you!

I set Target Milestone to 2.39 so it shows up in the 2.39 NEWS bug list.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 25+ messages in thread

end of thread, other threads:[~2023-08-29 17:31 UTC | newest]

Thread overview: 25+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-03-18 17:05 [Bug locale/30243] New: GB18030-2022 is not supported! starcold14 at gmail dot com
2023-03-18 17:12 ` [Bug locale/30243] " starcold14 at gmail dot com
2023-04-19  9:44 ` starcold14 at gmail dot com
2023-05-20 22:52 ` bruno at clisp dot org
2023-05-21 11:22 ` schwab@linux-m68k.org
2023-06-16  1:28 ` liqingqing3 at huawei dot com
2023-06-16  1:54 ` bruno at clisp dot org
2023-06-16  1:54 ` bruno at clisp dot org
2023-06-16  2:01 ` bruno at clisp dot org
2023-06-25  6:21 ` lijianglin2 at huawei dot com
2023-06-27  6:15 ` lijianglin2 at huawei dot com
2023-06-28  8:45 ` lijianglin2 at huawei dot com
2023-06-28 14:33 ` liqingqing3 at huawei dot com
2023-07-03  8:09 ` lijianglin2 at huawei dot com
2023-07-26 19:14 ` jamborm at gcc dot gnu.org
2023-08-16 13:26 ` matz at suse dot de
2023-08-16 19:20 ` carlos at redhat dot com
2023-08-16 19:20 ` [Bug locale/30243] Add full support for GB18030-2022 carlos at redhat dot com
2023-08-16 20:22 ` maiku.fabian at gmail dot com
2023-08-16 20:28 ` carlos at redhat dot com
2023-08-17  1:55 ` liqingqing3 at huawei dot com
2023-08-17 10:25 ` maiku.fabian at gmail dot com
2023-08-29 17:04 ` maiku.fabian at gmail dot com
2023-08-29 17:04 ` maiku.fabian at gmail dot com
2023-08-29 17:31 ` carlos at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).