public inbox for libc-locales@sourceware.org
 help / color / mirror / Atom feed
* [Bug localedata/20756] New: [PATCH] Use Unicode wise thousands separator
@ 2016-11-01 21:03 sbrabec at suse dot cz
  2016-11-01 23:56 ` [Bug localedata/20756] " carlos at redhat dot com
                   ` (5 more replies)
  0 siblings, 6 replies; 7+ messages in thread
From: sbrabec at suse dot cz @ 2016-11-01 21:03 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=20756

            Bug ID: 20756
           Summary: [PATCH] Use Unicode wise thousands separator
           Product: glibc
           Version: unspecified
            Status: UNCONFIRMED
          Severity: minor
          Priority: P2
         Component: localedata
          Assignee: unassigned at sourceware dot org
          Reporter: sbrabec at suse dot cz
                CC: libc-locales at sourceware dot org
  Target Milestone: ---

Created attachment 9605
  --> https://sourceware.org/bugzilla/attachment.cgi?id=9605&action=edit
Proposed changes

Many languages use small gap as thousands separator.

Thousands separator should not be a plain space, but a narrow space. And
additionally, it is not allowed to wrap number in the middle when wrapping
line.

Locale data were created in a deep age of 8-bit encodings, so most of them use
space (incorrect: it allows word wrapping in the middle of the number), or NBSP
(better, but typographically incorrect: space between group is too wide).

Now unicode is widely supported, so we should leave legacy characters in favor
of correct UNICODE character.

UNICODE has a dedicated character for this purpose:

NNBSP
U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space, typically the
width of a thin space or a mid space

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug localedata/20756] [PATCH] Use Unicode wise thousands separator
  2016-11-01 21:03 [Bug localedata/20756] New: [PATCH] Use Unicode wise thousands separator sbrabec at suse dot cz
@ 2016-11-01 23:56 ` carlos at redhat dot com
  2016-11-02 16:01 ` sbrabec at suse dot cz
                   ` (4 subsequent siblings)
  5 siblings, 0 replies; 7+ messages in thread
From: carlos at redhat dot com @ 2016-11-01 23:56 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=20756

Carlos O'Donell <carlos at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |WAITING
   Last reconfirmed|                            |2016-11-01
                 CC|                            |carlos at redhat dot com
     Ever confirmed|0                           |1

--- Comment #1 from Carlos O'Donell <carlos at redhat dot com> ---
(In reply to Stanislav Brabec from comment #0)
> Created attachment 9605 [details]
> Proposed changes
> 
> Many languages use small gap as thousands separator.
> 
> Thousands separator should not be a plain space, but a narrow space. And
> additionally, it is not allowed to wrap number in the middle when wrapping
> line.

Agreed.

> Locale data were created in a deep age of 8-bit encodings, so most of them
> use space (incorrect: it allows word wrapping in the middle of the number),
> or NBSP (better, but typographically incorrect: space between group is too
> wide).
> 
> Now unicode is widely supported, so we should leave legacy characters in
> favor of correct UNICODE character.
> 
> UNICODE has a dedicated character for this purpose:
> 
> NNBSP
> U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space, typically
> the width of a thin space or a mid space

I would support this change.

The NNBSP has been around since Unicode 3.0 so we support it across the board.

Can you please post this to libc-alpha following:
https://sourceware.org/glibc/wiki/Contribution%20checklist

Note that you don't need a copyright assignment for locale data changes as your
patch proposes.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug localedata/20756] [PATCH] Use Unicode wise thousands separator
  2016-11-01 21:03 [Bug localedata/20756] New: [PATCH] Use Unicode wise thousands separator sbrabec at suse dot cz
  2016-11-01 23:56 ` [Bug localedata/20756] " carlos at redhat dot com
@ 2016-11-02 16:01 ` sbrabec at suse dot cz
  2017-08-11  2:04 ` maiku.fabian at gmail dot com
                   ` (3 subsequent siblings)
  5 siblings, 0 replies; 7+ messages in thread
From: sbrabec at suse dot cz @ 2016-11-02 16:01 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=20756

--- Comment #2 from Stanislav Brabec <sbrabec at suse dot cz> ---
Sent to libc-alpha: https://sourceware.org/ml/libc-alpha/2016-11/msg00062.html

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug localedata/20756] [PATCH] Use Unicode wise thousands separator
  2016-11-01 21:03 [Bug localedata/20756] New: [PATCH] Use Unicode wise thousands separator sbrabec at suse dot cz
  2016-11-01 23:56 ` [Bug localedata/20756] " carlos at redhat dot com
  2016-11-02 16:01 ` sbrabec at suse dot cz
@ 2017-08-11  2:04 ` maiku.fabian at gmail dot com
  2017-08-13 12:05 ` cvs-commit at gcc dot gnu.org
                   ` (2 subsequent siblings)
  5 siblings, 0 replies; 7+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-08-11  2:04 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=20756

Mike FABIAN <maiku.fabian at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |maiku.fabian at gmail dot com
           Assignee|unassigned at sourceware dot org   |maiku.fabian at gmail dot com

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug localedata/20756] [PATCH] Use Unicode wise thousands separator
  2016-11-01 21:03 [Bug localedata/20756] New: [PATCH] Use Unicode wise thousands separator sbrabec at suse dot cz
                   ` (2 preceding siblings ...)
  2017-08-11  2:04 ` maiku.fabian at gmail dot com
@ 2017-08-13 12:05 ` cvs-commit at gcc dot gnu.org
  2017-08-13 18:52 ` maiku.fabian at gmail dot com
  2017-08-28 16:33 ` jsm28 at gcc dot gnu.org
  5 siblings, 0 replies; 7+ messages in thread
From: cvs-commit at gcc dot gnu.org @ 2017-08-13 12:05 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=20756

--- Comment #3 from cvs-commit at gcc dot gnu.org <cvs-commit at gcc dot gnu.org> ---
This is an automated email from the git hooks/post-receive script. It was
generated because a ref change was pushed to the repository containing
the project "GNU C Library master sources".

The branch, master has been updated
       via  a6bd872286b981b08577218c00e1ea693bad6095 (commit)
       via  d68b451903acfa1f8c64bb168b2cbe78beeacb3d (commit)
       via  70a6707fa15e63591d991761be025e26e8d02bb6 (commit)
      from  8b2c63e4e2ad1fd161f80004ed30624f2a37b57b (commit)

Those revisions listed above that are new to this repository have
not appeared on any other notification email; so we list those
revisions in full, below.

- Log -----------------------------------------------------------------
https://sourceware.org/git/gitweb.cgi?p=glibc.git;h=a6bd872286b981b08577218c00e1ea693bad6095

commit a6bd872286b981b08577218c00e1ea693bad6095
Author: Mike FABIAN <mfabian@redhat.com>
Date:   Sat Aug 12 17:15:25 2017 +0200

    Adapt test case data to the changes in the thousands separators

        [BZ #20756]
        * localedata/tst-langinfo.sh: Adapt test case data.
        * stdlib/tst-strfmon_l.c: Likewise.
        * stdlib/tst-strtod4.c: Likewise.
        * stdlib/tst-strtod5i.c: Likewise.

https://sourceware.org/git/gitweb.cgi?p=glibc.git;h=d68b451903acfa1f8c64bb168b2cbe78beeacb3d

commit d68b451903acfa1f8c64bb168b2cbe78beeacb3d
Author: Mike FABIAN <mfabian@redhat.com>
Date:   Sat Aug 12 15:34:36 2017 +0200

    Use U+202F instead of U+2009 as thousands_sep for es_MX

    See also [BZ #20756].

    U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space,
    typically the width of a thin space or a mid space.

    U+2009 THIN SPACE.

https://sourceware.org/git/gitweb.cgi?p=glibc.git;h=70a6707fa15e63591d991761be025e26e8d02bb6

commit 70a6707fa15e63591d991761be025e26e8d02bb6
Author: Stanislav Brabec <sbrabec@suse.cz>
Date:   Wed Nov 2 16:52:58 2016 +0100

    Locales: Use Unicode wise thousands separator

    Many languages use small gap as thousands separator.

    Thousands separator should not be a plain space, but a narrow space.
    And additionally, it is not allowed to wrap line in the middle of the
    number.

    Locale data were created in a deep age of 8-bit encodings, so most of
    them use space (incorrect: it allows wrapping the line in the middle
    of the number), or NBSP (better, but typographically incorrect: space
    between groups is too wide).

    Now UNICODE is widely supported, so we should leave legacy characters
    in favor of correct UNICODE character.

    UNICODE has a dedicated character for this purpose:

    NNBSP
    U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space,
    typically the width of a thin space or a mid space

    The NNBSP exists since Unicode 3.0.

    Use of NNBSP will prevent line wrapping in the midle of number and
    improve readability of numbers.

        [BZ #20756]
        * locales/aa_DJ (LC_MONETARY): Replace space by NNBSP as thousands
separator.
        * locales/az_AZ (LC_MONETARY): Likewise.
        * locales/be_BY (LC_MONETARY): Likewise.
        * locales/be_BY@latin (LC_MONETARY): Likewise.
        * locales/bg_BG (LC_MONETARY): Likewise.
        * locales/bs_BA (LC_MONETARY): Likewise.
        * locales/ce_RU (LC_MONETARY): Likewise.
        * locales/crh_UA (LC_MONETARY): Likewise.
        * locales/cs_CZ (LC_MONETARY): Likewise.
        * locales/cs_CZ (LC_NUMERIC): Likewise.
        * locales/cv_RU (LC_MONETARY): Likewise.
        * locales/de_AT (LC_MONETARY): Likewise.
        * locales/eo (LC_MONETARY): Likewise.
        * locales/es_CR (LC_MONETARY): Likewise.
        * locales/es_CR (LC_NUMERIC): Likewise.
        * locales/es_CU (LC_MONETARY): Likewise.
        * locales/et_EE (LC_MONETARY): Likewise.
        * locales/et_EE (LC_NUMERIC): Likewise.
        * locales/fi_FI (LC_MONETARY): Likewise.
        * locales/fi_FI (LC_NUMERIC): Likewise.
        * locales/fr_CA (LC_MONETARY): Likewise.
        * locales/fr_FR (LC_MONETARY): Likewise.
        * locales/fr_FR (LC_NUMERIC): Likewise.
        * locales/fr_LU (LC_MONETARY): Likewise.
        * locales/fr_LU (LC_NUMERIC): Likewise.
        * locales/hr_HR (LC_MONETARY): Likewise.
        * locales/ht_HT (LC_NUMERIC): Likewise.
        * locales/kk_KZ (LC_MONETARY): Likewise.
        * locales/kk_KZ (LC_NUMERIC): Likewise.
        * locales/ky_KG (LC_MONETARY): Likewise.
        * locales/ky_KG (LC_NUMERIC): Likewise.
        * locales/lv_LV (LC_MONETARY): Likewise.
        * locales/lv_LV (LC_NUMERIC): Likewise.
        * locales/mg_MG (LC_MONETARY): Likewise.
        * locales/mhr_RU (LC_MONETARY): Likewise.
        * locales/mk_MK (LC_MONETARY): Likewise.
        * locales/mk_MK (LC_NUMERIC): Likewise.
        * locales/mn_MN (LC_MONETARY): Likewise.
        * locales/nb_NO (LC_MONETARY): Likewise.
        * locales/nb_NO (LC_NUMERIC): Likewise.
        * locales/nl_AW (LC_MONETARY): Likewise.
        * locales/nl_NL (LC_MONETARY): Likewise.
        * locales/nn_NO (LC_MONETARY): Likewise.
        * locales/os_RU (LC_MONETARY): Likewise.
        * locales/pap_AW (LC_MONETARY): Likewise.
        * locales/pap_CW (LC_MONETARY): Likewise.
        * locales/ru_RU (LC_MONETARY): Likewise.
        * locales/ru_RU (LC_NUMERIC): Likewise.
        * locales/ru_UA (LC_MONETARY): Likewise.
        * locales/sk_SK (LC_MONETARY): Likewise.
        * locales/sk_SK (LC_NUMERIC): Likewise.
        * locales/sl_SI (LC_MONETARY): Likewise.
        * locales/sl_SI (LC_NUMERIC): Likewise.
        * locales/sq_MK (LC_MONETARY): Likewise.
        * locales/sv_SE (LC_MONETARY): Likewise.
        * locales/sv_SE (LC_NUMERIC): Likewise.
        * locales/tg_TJ (LC_MONETARY): Likewise.
        * locales/tt_RU (LC_MONETARY): Likewise.
        * locales/tt_RU@iqtelif (LC_MONETARY): Likewise.
        * locales/uk_UA (LC_MONETARY): Likewise.
        * locales/uk_UA (LC_NUMERIC): Likewise.
        * locales/unm_US (LC_MONETARY): Likewise.
        * locales/unm_US (LC_NUMERIC): Likewise.
        * locales/wo_SN (LC_MONETARY): Likewise.

-----------------------------------------------------------------------

Summary of changes:
 ChangeLog                        |   11 ++++++
 localedata/ChangeLog             |   72 ++++++++++++++++++++++++++++++++++++++
 localedata/locales/aa_DJ         |    2 +-
 localedata/locales/az_AZ         |    2 +-
 localedata/locales/be_BY         |    2 +-
 localedata/locales/be_BY@latin   |    2 +-
 localedata/locales/bg_BG         |    2 +-
 localedata/locales/bs_BA         |    2 +-
 localedata/locales/ce_RU         |    2 +-
 localedata/locales/crh_UA        |    2 +-
 localedata/locales/cs_CZ         |    4 +-
 localedata/locales/cv_RU         |    2 +-
 localedata/locales/de_AT         |    2 +-
 localedata/locales/eo            |    2 +-
 localedata/locales/es_CR         |    4 +-
 localedata/locales/es_CU         |    2 +-
 localedata/locales/es_MX         |    2 +-
 localedata/locales/et_EE         |    4 +-
 localedata/locales/fi_FI         |    4 +-
 localedata/locales/fr_CA         |    2 +-
 localedata/locales/fr_FR         |    4 +-
 localedata/locales/fr_LU         |    4 +-
 localedata/locales/hr_HR         |    2 +-
 localedata/locales/ht_HT         |    4 +-
 localedata/locales/kk_KZ         |    4 +-
 localedata/locales/ky_KG         |    4 +-
 localedata/locales/lv_LV         |    4 +-
 localedata/locales/mg_MG         |    2 +-
 localedata/locales/mhr_RU        |    2 +-
 localedata/locales/mk_MK         |    4 +-
 localedata/locales/mn_MN         |    2 +-
 localedata/locales/nb_NO         |    4 +-
 localedata/locales/nl_AW         |    2 +-
 localedata/locales/nl_NL         |    2 +-
 localedata/locales/nn_NO         |    2 +-
 localedata/locales/os_RU         |    2 +-
 localedata/locales/pap_AW        |    2 +-
 localedata/locales/pap_CW        |    2 +-
 localedata/locales/ru_RU         |    4 +-
 localedata/locales/ru_UA         |    2 +-
 localedata/locales/sk_SK         |    4 +-
 localedata/locales/sl_SI         |    4 +-
 localedata/locales/sq_MK         |    2 +-
 localedata/locales/sv_SE         |    4 +-
 localedata/locales/tg_TJ         |    2 +-
 localedata/locales/tt_RU         |    2 +-
 localedata/locales/tt_RU@iqtelif |    2 +-
 localedata/locales/uk_UA         |    4 +-
 localedata/locales/unm_US        |    4 +-
 localedata/locales/wo_SN         |    2 +-
 localedata/tst-langinfo.sh       |    2 +-
 stdlib/tst-strfmon_l.c           |    8 ++--
 stdlib/tst-strtod4.c             |    6 ++--
 stdlib/tst-strtod5i.c            |    6 ++--
 54 files changed, 160 insertions(+), 77 deletions(-)

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug localedata/20756] [PATCH] Use Unicode wise thousands separator
  2016-11-01 21:03 [Bug localedata/20756] New: [PATCH] Use Unicode wise thousands separator sbrabec at suse dot cz
                   ` (3 preceding siblings ...)
  2017-08-13 12:05 ` cvs-commit at gcc dot gnu.org
@ 2017-08-13 18:52 ` maiku.fabian at gmail dot com
  2017-08-28 16:33 ` jsm28 at gcc dot gnu.org
  5 siblings, 0 replies; 7+ messages in thread
From: maiku.fabian at gmail dot com @ 2017-08-13 18:52 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=20756

Mike FABIAN <maiku.fabian at gmail dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|WAITING                     |RESOLVED
         Resolution|---                         |FIXED

--- Comment #4 from Mike FABIAN <maiku.fabian at gmail dot com> ---
FIXED.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* [Bug localedata/20756] [PATCH] Use Unicode wise thousands separator
  2016-11-01 21:03 [Bug localedata/20756] New: [PATCH] Use Unicode wise thousands separator sbrabec at suse dot cz
                   ` (4 preceding siblings ...)
  2017-08-13 18:52 ` maiku.fabian at gmail dot com
@ 2017-08-28 16:33 ` jsm28 at gcc dot gnu.org
  5 siblings, 0 replies; 7+ messages in thread
From: jsm28 at gcc dot gnu.org @ 2017-08-28 16:33 UTC (permalink / raw)
  To: libc-locales

https://sourceware.org/bugzilla/show_bug.cgi?id=20756

Joseph Myers <jsm28 at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|WAITING                     |RESOLVED
         Resolution|---                         |FIXED
   Target Milestone|---                         |2.27

--- Comment #3 from Joseph Myers <jsm28 at gcc dot gnu.org> ---
Restoring changes lost in system crash and restore from backup.

https://sourceware.org/ml/glibc-bugs/2017-08/msg00358.html
https://sourceware.org/ml/glibc-bugs/2017-08/msg00359.html

-- 
You are receiving this mail because:
You are on the CC list for the bug.

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2017-08-28 16:29 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-11-01 21:03 [Bug localedata/20756] New: [PATCH] Use Unicode wise thousands separator sbrabec at suse dot cz
2016-11-01 23:56 ` [Bug localedata/20756] " carlos at redhat dot com
2016-11-02 16:01 ` sbrabec at suse dot cz
2017-08-11  2:04 ` maiku.fabian at gmail dot com
2017-08-13 12:05 ` cvs-commit at gcc dot gnu.org
2017-08-13 18:52 ` maiku.fabian at gmail dot com
2017-08-28 16:33 ` jsm28 at gcc dot gnu.org

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).