public inbox for libc-locales@sourceware.org
 help / color / mirror / Atom feed
* [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
@ 2017-09-03 21:01 vapier at gentoo dot org
  2017-09-04  8:10 ` [Bug localedata/22074] " tjk at tksoft dot com
                   ` (11 more replies)
  0 siblings, 12 replies; 14+ messages in thread
From: vapier at gentoo dot org @ 2017-09-03 21:01 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

            Bug ID: 22074
           Summary: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul
                    Jungseong and Jongseong) should be 0
           Product: glibc
           Version: 2.26
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
          Assignee: unassigned at sourceware dot org
          Reporter: vapier at gentoo dot org
                CC: egmont at gmail dot com, libc-locales at sourceware dot org,
                    maiku.fabian at gmail dot com, tg at mirbsd dot de
        Depends on: 21750
  Target Milestone: ---

+++ This bug was initially created as a clone of Bug #21750 +++

I’ve compared the new autogenerated column width from
localedata/unicode-gen/utf8_gen.py with the results of the classical wcwidth()
implementation from xterm (adjusted to Unicode 10.0.0) and found a few
divergences (and bugs on my (MirBSD, which uses something based on xterm’s data
system-wide) side, which I fixed).

Hangul Jamo medial vowels and final consonants are set to 0 by xterm so they
combine on top of the preceding initial ones: U+1160‥U+11FF

Change Request to glibc: force U+1160‥U+11FF to width 0.

Markus Kuhn's implementation of wcwidth does this explicitly:
  https://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c
  /* generated by "uniset +cat=Me +cat=Mn +cat=Cf -00AD +1160-11FF +200B c" */

Unicode 10.0.0 chapter 18 section 6 page 713 [1] states:
The Hangul Jamo block contains the most frequently used conjoining jamo. These
include all of the jamo used in modern Hangul syllable blocks, as well as many
of the jamo for Old Korean. The Hangul jamo are divided into three classes:
choseong (leading consonants, or syllable-initial characters), jungseong
(vowels, or syllable-peak characters), and jongseong (trailing consonants, or
syllable-final characters). Each class may, in turn, consist of one to three
subunits. For example, a choseong syllable-initial character may either
represent a single consonant sound, or a consonant cluster consisting of two or
three consonant sounds. Likewise, a jungseong syllable-peak character may
represent a simple vowel sound, or a complex diphthong or triphthong with
onglide or offglide sounds. Each of these complex sequences of two or three
sounds is encoded as a single conjoining jamo character. Therefore, a complete
Hangul syllable can always be conceived of as a single choseong followed by a
single jungseong and (optionally) a single jongseong. This block also contains
two invisible filler characters which act as placeholders for a missing
choseong or jungseong in an incomplete syllable. These filler characters are
U+115F hangul choseong filler and U+1160 hangul jungseong filler.

[1] http://www.unicode.org/versions/Unicode10.0.0/ch18.pdf

a concrete example, these three codepoints (one from each class):
  ᅔ U+1154 Hangul Choseong Chitueumchieuch
  ᅦ U+1166 Hangul Jungseong E
  ᇯ U+11ef Hangul Jongseong Ieung-Khieukh

will form a single grapheme that should be wcwidth of 1:
  ᅔᅦᇯ

the tricky part is that Jungseong & Jongseong conjoin only when they follow a
Choseong.  so if you had a space U+0020 between each of those codepoints, you'd
end up with wcwidth of 5.


Referenced Bugs:

https://sourceware.org/bugzilla/show_bug.cgi?id=21750
[Bug 21750] column width of characters incompatible with classical wcwidth
-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
@ 2017-09-04  8:10 ` tjk at tksoft dot com
  2017-09-04  8:10 ` maiku.fabian at gmail dot com
                   ` (10 subsequent siblings)
  11 siblings, 0 replies; 14+ messages in thread
From: tjk at tksoft dot com @ 2017-09-04  8:10 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

--- Comment #2 from Troy Korjuslommi <tjk at tksoft dot com> ---
Aren't Korean chars usually full width? I.e. wcwidth 2.

Troy



On Sun, 2017-09-03 at 21:01 +0000, vapier at gentoo dot org wrote:
> https://sourceware.org/bugzilla/show_bug.cgi?id=22074
> 
>             Bug ID: 22074
>            Summary: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul
>                     Jungseong and Jongseong) should be 0
>            Product: glibc
>            Version: 2.26
>             Status: NEW
>           Severity: normal
>           Priority: P2
>          Component: localedata
>           Assignee: unassigned at sourceware dot org
>           Reporter: vapier at gentoo dot org
>                 CC: egmont at gmail dot com, libc-locales at sourceware dot org,
>                     maiku.fabian at gmail dot com, tg at mirbsd dot de
>         Depends on: 21750
>   Target Milestone: ---
> 
> +++ This bug was initially created as a clone of Bug #21750 +++
> 
> I’ve compared the new autogenerated column width from
> localedata/unicode-gen/utf8_gen.py with the results of the classical wcwidth()
> implementation from xterm (adjusted to Unicode 10.0.0) and found a few
> divergences (and bugs on my (MirBSD, which uses something based on xterm’s data
> system-wide) side, which I fixed).
> 
> Hangul Jamo medial vowels and final consonants are set to 0 by xterm so they
> combine on top of the preceding initial ones: U+1160‥U+11FF
> 
> Change Request to glibc: force U+1160‥U+11FF to width 0.
> 
> Markus Kuhn's implementation of wcwidth does this explicitly:
>   https://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c
>   /* generated by "uniset +cat=Me +cat=Mn +cat=Cf -00AD +1160-11FF +200B c" */
> 
> Unicode 10.0.0 chapter 18 section 6 page 713 [1] states:
> The Hangul Jamo block contains the most frequently used conjoining jamo. These
> include all of the jamo used in modern Hangul syllable blocks, as well as many
> of the jamo for Old Korean. The Hangul jamo are divided into three classes:
> choseong (leading consonants, or syllable-initial characters), jungseong
> (vowels, or syllable-peak characters), and jongseong (trailing consonants, or
> syllable-final characters). Each class may, in turn, consist of one to three
> subunits. For example, a choseong syllable-initial character may either
> represent a single consonant sound, or a consonant cluster consisting of two or
> three consonant sounds. Likewise, a jungseong syllable-peak character may
> represent a simple vowel sound, or a complex diphthong or triphthong with
> onglide or offglide sounds. Each of these complex sequences of two or three
> sounds is encoded as a single conjoining jamo character. Therefore, a complete
> Hangul syllable can always be conceived of as a single choseong followed by a
> single jungseong and (optionally) a single jongseong. This block also contains
> two invisible filler characters which act as placeholders for a missing
> choseong or jungseong in an incomplete syllable. These filler characters are
> U+115F hangul choseong filler and U+1160 hangul jungseong filler.
> 
> [1] http://www.unicode.org/versions/Unicode10.0.0/ch18.pdf
> 
> a concrete example, these three codepoints (one from each class):
>   ᅔ U+1154 Hangul Choseong Chitueumchieuch
>   ᅦ U+1166 Hangul Jungseong E
>   ᇯ U+11ef Hangul Jongseong Ieung-Khieukh
> 
> will form a single grapheme that should be wcwidth of 1:
>   ᅔᅦᇯ
> 
> the tricky part is that Jungseong & Jongseong conjoin only when they follow a
> Choseong.  so if you had a space U+0020 between each of those codepoints, you'd
> end up with wcwidth of 5.
> 
> 
> Referenced Bugs:
> 
> https://sourceware.org/bugzilla/show_bug.cgi?id=21750
> [Bug 21750] column width of characters incompatible with classical wcwidth

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
  2017-09-04  8:10 ` [Bug localedata/22074] " tjk at tksoft dot com
  2017-09-04  8:10 ` maiku.fabian at gmail dot com
@ 2017-09-04  8:10 ` Troy Korjuslommi
  2017-09-04  8:16 ` [Bug localedata/22074] " maiku.fabian at gmail dot com
                   ` (8 subsequent siblings)
  11 siblings, 0 replies; 14+ messages in thread
From: Troy Korjuslommi @ 2017-09-04  8:10 UTC (permalink / raw)
  To: vapier at gentoo dot org; +Cc: libc-locales

Aren't Korean chars usually full width? I.e. wcwidth 2.

Troy



On Sun, 2017-09-03 at 21:01 +0000, vapier at gentoo dot org wrote:
> https://sourceware.org/bugzilla/show_bug.cgi?id=22074
> 
>             Bug ID: 22074
>            Summary: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul
>                     Jungseong and Jongseong) should be 0
>            Product: glibc
>            Version: 2.26
>             Status: NEW
>           Severity: normal
>           Priority: P2
>          Component: localedata
>           Assignee: unassigned at sourceware dot org
>           Reporter: vapier at gentoo dot org
>                 CC: egmont at gmail dot com, libc-locales at sourceware dot org,
>                     maiku.fabian at gmail dot com, tg at mirbsd dot de
>         Depends on: 21750
>   Target Milestone: ---
> 
> +++ This bug was initially created as a clone of Bug #21750 +++
> 
> I’ve compared the new autogenerated column width from
> localedata/unicode-gen/utf8_gen.py with the results of the classical wcwidth()
> implementation from xterm (adjusted to Unicode 10.0.0) and found a few
> divergences (and bugs on my (MirBSD, which uses something based on xterm’s data
> system-wide) side, which I fixed).
> 
> Hangul Jamo medial vowels and final consonants are set to 0 by xterm so they
> combine on top of the preceding initial ones: U+1160‥U+11FF
> 
> Change Request to glibc: force U+1160‥U+11FF to width 0.
> 
> Markus Kuhn's implementation of wcwidth does this explicitly:
>   https://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c
>   /* generated by "uniset +cat=Me +cat=Mn +cat=Cf -00AD +1160-11FF +200B c" */
> 
> Unicode 10.0.0 chapter 18 section 6 page 713 [1] states:
> The Hangul Jamo block contains the most frequently used conjoining jamo. These
> include all of the jamo used in modern Hangul syllable blocks, as well as many
> of the jamo for Old Korean. The Hangul jamo are divided into three classes:
> choseong (leading consonants, or syllable-initial characters), jungseong
> (vowels, or syllable-peak characters), and jongseong (trailing consonants, or
> syllable-final characters). Each class may, in turn, consist of one to three
> subunits. For example, a choseong syllable-initial character may either
> represent a single consonant sound, or a consonant cluster consisting of two or
> three consonant sounds. Likewise, a jungseong syllable-peak character may
> represent a simple vowel sound, or a complex diphthong or triphthong with
> onglide or offglide sounds. Each of these complex sequences of two or three
> sounds is encoded as a single conjoining jamo character. Therefore, a complete
> Hangul syllable can always be conceived of as a single choseong followed by a
> single jungseong and (optionally) a single jongseong. This block also contains
> two invisible filler characters which act as placeholders for a missing
> choseong or jungseong in an incomplete syllable. These filler characters are
> U+115F hangul choseong filler and U+1160 hangul jungseong filler.
> 
> [1] http://www.unicode.org/versions/Unicode10.0.0/ch18.pdf
> 
> a concrete example, these three codepoints (one from each class):
>   á…” U+1154 Hangul Choseong Chitueumchieuch
>   á…¦ U+1166 Hangul Jungseong E
>   ᇯ U+11ef Hangul Jongseong Ieung-Khieukh
> 
> will form a single grapheme that should be wcwidth of 1:
>   ᅔᅦᇯ
> 
> the tricky part is that Jungseong & Jongseong conjoin only when they follow a
> Choseong.  so if you had a space U+0020 between each of those codepoints, you'd
> end up with wcwidth of 5.
> 
> 
> Referenced Bugs:
> 
> https://sourceware.org/bugzilla/show_bug.cgi?id=21750
> [Bug 21750] column width of characters incompatible with classical wcwidth



^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
  2017-09-04  8:10 ` [Bug localedata/22074] " tjk at tksoft dot com
@ 2017-09-04  8:10 ` maiku.fabian at gmail dot com
  2017-09-04  8:10 ` [Bug localedata/22074] New: " Troy Korjuslommi
                   ` (9 subsequent siblings)
  11 siblings, 0 replies; 14+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-09-04  8:10 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

--- Comment #1 from Mike FABIAN <maiku.fabian at gmail dot com> ---
(In reply to Mike Frysinger from comment #0)
> 
> the tricky part is that Jungseong & Jongseong conjoin only when they follow
> a Choseong.  so if you had a space U+0020 between each of those codepoints,
> you'd end up with wcwidth of 5.

But we cannot really do anything about the context, so we have to decide
for one width to use.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
                   ` (2 preceding siblings ...)
  2017-09-04  8:10 ` [Bug localedata/22074] New: " Troy Korjuslommi
@ 2017-09-04  8:16 ` maiku.fabian at gmail dot com
  2017-09-04  9:51 ` maiku.fabian at gmail dot com
                   ` (7 subsequent siblings)
  11 siblings, 0 replies; 14+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-09-04  8:16 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

Mike FABIAN <maiku.fabian at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Assignee|unassigned at sourceware dot org   |maiku.fabian at gmail dot com

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
                   ` (3 preceding siblings ...)
  2017-09-04  8:16 ` [Bug localedata/22074] " maiku.fabian at gmail dot com
@ 2017-09-04  9:51 ` maiku.fabian at gmail dot com
  2017-09-11 11:25   ` Troy Korjuslommi
  2017-09-05 14:55 ` vapier at gentoo dot org
                   ` (6 subsequent siblings)
  11 siblings, 1 reply; 14+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-09-04  9:51 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

--- Comment #3 from Mike FABIAN <maiku.fabian at gmail dot com> ---
(In reply to Troy Korjuslommi from comment #2)
> Aren't Korean chars usually full width? I.e. wcwidth 2.
> 
> Troy

Yes, but the characters we are discussing here are *parts*
of Korean characters, not *whole* Korean characters.

See Mike Frysinger’s example in comment#0.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
                   ` (4 preceding siblings ...)
  2017-09-04  9:51 ` maiku.fabian at gmail dot com
@ 2017-09-05 14:55 ` vapier at gentoo dot org
  2017-09-06  9:30 ` maiku.fabian at gmail dot com
                   ` (5 subsequent siblings)
  11 siblings, 0 replies; 14+ messages in thread
From: vapier at gentoo dot org @ 2017-09-05 14:55 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

--- Comment #4 from Mike Frysinger <vapier at gentoo dot org> ---
(In reply to Mike FABIAN from comment #1)
> But we cannot really do anything about the context, so we have to decide
> for one width to use.

today that is true.  long term, i think we'll need to figure out something
better similar to UAX29.  it wouldn't help with wcwidth, but it would for
wcswidth.
  http://www.unicode.org/reports/tr29/

so i think for today, supporting the common case and not the degenerate/invalid
cases makes sense.  which is to say, marking them as wcwidth of 0.

(In reply to Troy Korjuslommi from comment #2)
> Aren't Korean chars usually full width? I.e. wcwidth 2.

the precomposed hangul forms are wcwidth of 2:
  https://en.wikipedia.org/wiki/Hangul_Syllables

so i guess not only would we want to change the two sets to 0, we'd want to
change the first set to 2.  that way we'd line up with the precomposed forms
better.

i think that should get us closer, but it'd still be an approximation.  maybe
we should just focus on wcswidth here ?

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
                   ` (5 preceding siblings ...)
  2017-09-05 14:55 ` vapier at gentoo dot org
@ 2017-09-06  9:30 ` maiku.fabian at gmail dot com
  2017-09-06 14:38 ` maiku.fabian at gmail dot com
                   ` (4 subsequent siblings)
  11 siblings, 0 replies; 14+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-09-06  9:30 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

--- Comment #5 from Mike FABIAN <maiku.fabian at gmail dot com> ---
(In reply to Mike Frysinger from comment #4)
> (In reply to Mike FABIAN from comment #1)

> (In reply to Troy Korjuslommi from comment #2)
> > Aren't Korean chars usually full width? I.e. wcwidth 2.
> 
> the precomposed hangul forms are wcwidth of 2:
>   https://en.wikipedia.org/wiki/Hangul_Syllables
> 
> so i guess not only would we want to change the two sets to 0, we'd want to
> change the first set to 2.  that way we'd line up with the precomposed forms
> better.

I think we already set the precomposed hangul to width 2 because of this line

AC00..D7A3;W     # Lo [11172] HANGUL SYLLABLE GA..HANGUL SYLLABLE HIH

in EastAsianWidth.txt

we have

<UAC00>...<UD7A3>       2

in charmaps/UTF-8 in the WIDTH section.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
                   ` (6 preceding siblings ...)
  2017-09-06  9:30 ` maiku.fabian at gmail dot com
@ 2017-09-06 14:38 ` maiku.fabian at gmail dot com
  2017-09-11 15:44 ` tjk at tksoft dot com
                   ` (3 subsequent siblings)
  11 siblings, 0 replies; 14+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-09-06 14:38 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

--- Comment #6 from Mike FABIAN <maiku.fabian at gmail dot com> ---
So this bug is fixed, isn’t it?

Because in glibc master in charmaps/UTF-8, we have the
precomposed hangul with width 2 and the hangul jamo with width 0.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-04  9:51 ` maiku.fabian at gmail dot com
@ 2017-09-11 11:25   ` Troy Korjuslommi
  0 siblings, 0 replies; 14+ messages in thread
From: Troy Korjuslommi @ 2017-09-11 11:25 UTC (permalink / raw)
  To: maiku.fabian at gmail dot com; +Cc: libc-locales

I was referring to the "should be wcwidth of 1" comment, which doesn't
seem to be correct. My point was that the wcwidth should  be either 0 or
two. 

Troy


On Mon, 2017-09-04 at 08:54 +0000, maiku.fabian at gmail dot com wrote:
> https://sourceware.org/bugzilla/show_bug.cgi?id=22074
> 
> --- Comment #3 from Mike FABIAN <maiku.fabian at gmail dot com> ---
> (In reply to Troy Korjuslommi from comment #2)
> > Aren't Korean chars usually full width? I.e. wcwidth 2.
> > 
> > Troy
> 
> Yes, but the characters we are discussing here are *parts*
> of Korean characters, not *whole* Korean characters.
> 
> See Mike Frysinger’s example in comment#0.
> 


^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
                   ` (7 preceding siblings ...)
  2017-09-06 14:38 ` maiku.fabian at gmail dot com
@ 2017-09-11 15:44 ` tjk at tksoft dot com
  2017-09-11 20:09 ` maiku.fabian at gmail dot com
                   ` (2 subsequent siblings)
  11 siblings, 0 replies; 14+ messages in thread
From: tjk at tksoft dot com @ 2017-09-11 15:44 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

--- Comment #7 from Troy Korjuslommi <tjk at tksoft dot com> ---
I was referring to the "should be wcwidth of 1" comment, which doesn't
seem to be correct. My point was that the wcwidth should  be either 0 or
two. 

Troy


On Mon, 2017-09-04 at 08:54 +0000, maiku.fabian at gmail dot com wrote:
> https://sourceware.org/bugzilla/show_bug.cgi?id=22074
> 
> --- Comment #3 from Mike FABIAN <maiku.fabian at gmail dot com> ---
> (In reply to Troy Korjuslommi from comment #2)
> > Aren't Korean chars usually full width? I.e. wcwidth 2.
> > 
> > Troy
> 
> Yes, but the characters we are discussing here are *parts*
> of Korean characters, not *whole* Korean characters.
> 
> See Mike Frysinger’s example in comment#0.
>

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
                   ` (8 preceding siblings ...)
  2017-09-11 15:44 ` tjk at tksoft dot com
@ 2017-09-11 20:09 ` maiku.fabian at gmail dot com
  2017-09-12 16:30 ` maiku.fabian at gmail dot com
  2017-09-14 14:11 ` maiku.fabian at gmail dot com
  11 siblings, 0 replies; 14+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-09-11 20:09 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

Mike FABIAN <maiku.fabian at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |FIXED

--- Comment #8 from Mike FABIAN <maiku.fabian at gmail dot com> ---
I think this is fixed:

https://sourceware.org/bugzilla/show_bug.cgi?id=22074#c6

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
                   ` (9 preceding siblings ...)
  2017-09-11 20:09 ` maiku.fabian at gmail dot com
@ 2017-09-12 16:30 ` maiku.fabian at gmail dot com
  2017-09-14 14:11 ` maiku.fabian at gmail dot com
  11 siblings, 0 replies; 14+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-09-12 16:30 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074

Mike FABIAN <maiku.fabian at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
   Target Milestone|---                         |2.27

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [Bug localedata/22074] charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0
  2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
                   ` (10 preceding siblings ...)
  2017-09-12 16:30 ` maiku.fabian at gmail dot com
@ 2017-09-14 14:11 ` maiku.fabian at gmail dot com
  11 siblings, 0 replies; 14+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-09-14 14:11 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=22074
Bug 22074 depends on bug 21750, which changed state.

Bug 21750 Summary: column width of characters incompatible with classical wcwidth
https://sourceware.org/bugzilla/show_bug.cgi?id=21750

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|REOPENED                    |RESOLVED
         Resolution|---                         |FIXED

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2017-09-14 13:45 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-09-03 21:01 [Bug localedata/22074] New: charmaps/UTF-8: wcwidth for U+1160-U+11FF (Hangul Jungseong and Jongseong) should be 0 vapier at gentoo dot org
2017-09-04  8:10 ` [Bug localedata/22074] " tjk at tksoft dot com
2017-09-04  8:10 ` maiku.fabian at gmail dot com
2017-09-04  8:10 ` [Bug localedata/22074] New: " Troy Korjuslommi
2017-09-04  8:16 ` [Bug localedata/22074] " maiku.fabian at gmail dot com
2017-09-04  9:51 ` maiku.fabian at gmail dot com
2017-09-11 11:25   ` Troy Korjuslommi
2017-09-05 14:55 ` vapier at gentoo dot org
2017-09-06  9:30 ` maiku.fabian at gmail dot com
2017-09-06 14:38 ` maiku.fabian at gmail dot com
2017-09-11 15:44 ` tjk at tksoft dot com
2017-09-11 20:09 ` maiku.fabian at gmail dot com
2017-09-12 16:30 ` maiku.fabian at gmail dot com
2017-09-14 14:11 ` maiku.fabian at gmail dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).