public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug localedata/13237] New: country_name field of LC_ADDRESS
@ 2011-09-30  6:34 cjlhomeaddress at gmail dot com
  2011-10-11  8:04 ` [Bug localedata/13237] " claude at 2xlibre dot net
                   ` (20 more replies)
  0 siblings, 21 replies; 22+ messages in thread
From: cjlhomeaddress at gmail dot com @ 2011-09-30  6:34 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=13237

             Bug #: 13237
           Summary: country_name field of LC_ADDRESS
           Product: glibc
           Version: 2.14
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
        AssignedTo: libc-locales@sources.redhat.com
        ReportedBy: cjlhomeaddress@gmail.com
    Classification: Unclassified


Created attachment 5954
  --> http://sourceware.org/bugzilla/attachment.cgi?id=5954
Summary of glibc country_name field entries.

I have performed a comprehensive analysis of the use of the LC_ADDRESS field
for country_name.  I am somewhat concerned by the findings of that analysis for
a field that should be populated with the name of the country in the language
of the locale, two pieces of information inherent in the locale name.


There are 279 locales (excluding the deprecated iw_IL).

Of those 279, only 84 locales have populated country_name fields. 



84 populated

43 empty, (not readily determined)

152 empty, but can be easily determined by look-up in ISO-3166 L10n files.

equals 279 total


Of the 84 populated country_name fields:

37 can be confirmed from ISO-3166 L10n files.

31 cannot be confirmed from ISO-3166 L10n files (not necessarily a problem).

16 have obvious encoding errors or require review and / or correction.


Examples of errors:

km_KH encodes Lao characters spelling Laos, not Khmer characters spelling
Cambodia.

bg_BG, ku_TR, mk_MK, mn_MN, tr_TR encode English, not native language/script
names

bo_CN and bo_IN coded as FIXME, should be commented out.

dz_BT coded as BHU

ur_IN uses "copy hi_IN", thus encoding Localein Hindi, not Urdu language name
of India.

en-US encodes USA (not United States)
es-US encodes USA (not Estados Unidos)

Others include conflicts with ISO-3166 entries that require clarification.

Some consideration should be given to correcting the obvious errors and making
the easily confirmed additions so that the LC_ADDRESS country_name field is
more usefully populated with the country name of the locale in the language of
the locale.


The first column attached spreadsheet contains links to 2xlibre.net locale
files (purely for convenience), This data had been recently refreshed from 2.14
release.

All details checked against original sources at:
http://sourceware.org/git/?p=glibc.git;a=tree;f=localedata/locales;h=aa17c365ce474cfb9c7dab92b623bfb5a8786208;hb=HEAD

The key columns are the "Action" (suggested) and the "Corrected country_name"
column.  The entries in the "Evidence ISO-3166" column link directly to the
relevant location within the PO files.

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 22+ messages in thread

end of thread, other threads:[~2014-06-27 12:00 UTC | newest]

Thread overview: 22+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-09-30  6:34 [Bug localedata/13237] New: country_name field of LC_ADDRESS cjlhomeaddress at gmail dot com
2011-10-11  8:04 ` [Bug localedata/13237] " claude at 2xlibre dot net
2011-12-22 16:55 ` drepper.fsp at gmail dot com
2011-12-22 17:07 ` cjlhomeaddress at gmail dot com
2011-12-23  8:31 ` claude at 2xlibre dot net
2011-12-23 14:54 ` drepper.fsp at gmail dot com
2011-12-23 15:00 ` cjlhomeaddress at gmail dot com
2012-04-06  2:48 ` cjlhomeaddress at gmail dot com
2012-04-06  3:43 ` cjlhomeaddress at gmail dot com
2012-04-06  4:24 ` cjlhomeaddress at gmail dot com
2012-04-06  7:22 ` cjlhomeaddress at gmail dot com
2012-04-06  7:24 ` cjlhomeaddress at gmail dot com
2012-04-06  7:25 ` cjlhomeaddress at gmail dot com
2012-04-06 20:36 ` bugtrack at roumenpetrov dot info
2012-04-06 20:37 ` bugtrack at roumenpetrov dot info
2012-04-06 20:38 ` bugtrack at roumenpetrov dot info
2012-04-06 20:43 ` bugtrack at roumenpetrov dot info
2012-04-07  1:34 ` cjlhomeaddress at gmail dot com
2012-11-15  8:23 ` aj at suse dot de
2012-11-15 18:39 ` aj at suse dot de
2013-01-03  9:04 ` aj at suse dot de
2014-06-27 12:00 ` fweimer at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).