From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 3770D388A835; Sun, 21 Jun 2020 09:07:02 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 3770D388A835 From: "maiku.fabian at gmail dot com" To: libc-locales@sourceware.org Subject: [Bug localedata/26120] column width of of some Korean JUNGSEONG/JONGSEONG characters wrong (should be 0) Date: Sun, 21 Jun 2020 09:07:02 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: 2.31 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: maiku.fabian at gmail dot com X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: maiku.fabian at gmail dot com X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: libc-locales@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-locales mailing list List-Unsubscribe: , List-Archive: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 21 Jun 2020 09:07:02 -0000 https://sourceware.org/bugzilla/show_bug.cgi?id=3D26120 --- Comment #9 from Mike FABIAN --- (In reply to Thorsten Glaser from comment #7) > Erk=E2=80=A6 glibc is particular about not defining widths of not-defined= characters. >=20 > Besides D7FC=E2=80=A5D7FF (which gave me an error in the output from my o= wn > scripts), D7C7=E2=80=A5D7CA are not yet assigned and so probably need to = be excluded > in glibc. >=20 > Should they ever be defined, we=E2=80=99ll need to adjust here, so it=E2= =80=99s probably > better to iterate over the entire D7C0=E2=80=A5D7FF range and ony change = widths for > defined codepoints from the current UCD version. Thank you for noticing that! I was aware that glibc has a problem with defining width of unassigned characters, therefore I used=20 for key in list(range(0xD7B0, 0xD7FC)): instead of=20 for key in list(range(0xD7B0, 0xD800)): because D7FC and D7FF are undefined and localedef gave me errors when I included them. Surprisingly localedef did not give errors for the unassigned D7C7=E2=80=A5D7CA ... I had checked the range manually and thought all characters from D7B0 to D7FB were assigned, but apparently I missed D7C7=E2=80=A5D7CA. I improved the generator script a bit to omit the unassigned characters, if these get defined in future, the script would add them. --=20 You are receiving this mail because: You are on the CC list for the bug.=