From: Mike Frysinger <vapier@gentoo.org>
To: Chris Leonard <cjlhomeaddress@gmail.com>
Cc: Marko Myllynen <myllynen@redhat.com>,
libc-alpha <libc-alpha@sourceware.org>,
Carlos O'Donell <carlos@redhat.com>,
Florian Weimer <fweimer@redhat.com>
Subject: Re: [PATCH 1/3] localedata: use same comment_char/escape_char in these files
Date: Mon, 11 Apr 2016 18:58:00 -0000 [thread overview]
Message-ID: <20160411185850.GL6588@vapier.lan> (raw)
In-Reply-To: <CAHdAata8SwOO4a1QFdG3m1L6238j3eX8zscYqkPtVW6dWiLz=A@mail.gmail.com>
[-- Attachment #1: Type: text/plain, Size: 2933 bytes --]
On 13 Mar 2016 13:49, Chris Leonard wrote:
> One fundamental way of looking at this issue is to distill it down to
> just unique language codes (ignoring countries, scripts and currency
> variants).
right -- i don't care nearly as much about the locale combos (lang +
territory). they do provide unambiguous direction at that point, but
with a little effort, we can still get good data w/out them.
some of the data is territory-specific and lang-independent, so as long
as cldr has details about all the territories glibc uses (and it does
today), then that part is fine. i don't think we should add any that
are not listed in the cldr since it looks pretty complete (as it pulls
in a number of other standards).
> Between both projects combined, there are 257 unique language codes*.
>
> 115 codes are common between both projects.
> 79 languages are represented in CLDR only
> 63 languages are represented in glibc only.
as long as cldr has at least a territory-independent lang entry,
we can extract a good amount of detail out of that.
my concerns start when cldr lacks any lang info at all, or even more
problematic, has marked that lang code as deprecated or uses a diff
naming convention. there appears to be about 65 langs / 71 locales
on the glibc side (ignoring @script variants) that fall into these
buckets. looks like about the same count as you have.
> *Note: one exception is the qu/quz conflict between selecting language
> codes to represent the Quechua languages of the Andes. I counted this
> as in common, although it will require some resolution going forward,
> as will the Aymara ay/ayc choice (there is no existing CLDR locale for
> Aymara at present).
>
> There are very possibly a few other such code selection issues which I
> will look into further, I have a nagging suspicion that something is
> going on with the Sotho language codes of Africa, but I need to
> confirm that. In any event, those wouldn't change the overall numbers
> much.
>
> Overall, I would not declare one project the winner over the other in
> terms of best representing languages, clearly some cross-porting
> should be done where possible for the sake of language communities
> dependent on either locale type.
>
> I'm here because I work with glibc-dependent language communities, so
> that has been my focus. I have not tried to work with CLDR on
> locales. Anyone here have experience with that? How
> welcoming/responsive are they to people who are trying to act as
> intermediaries for minority language communities?
seeing as you can represent the concerns of these communities better
than probably any of us, it would be great if you could look into the
cldr process. from my glances around there, it doesn't look *too*
hard to break in and start posting contributions, especially when you
have no one else representing those languages.
-mike
[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 819 bytes --]
next prev parent reply other threads:[~2016-04-11 18:58 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-02-20 7:54 Mike Frysinger
2016-02-20 7:54 ` [PATCH 2/3] localedata: standardize first few lines Mike Frysinger
2016-02-20 7:55 ` [PATCH 3/3] localedata: standardize copyright/license information [BZ #11213] Mike Frysinger
2016-02-25 20:13 ` [PATCH 1/3] localedata: use same comment_char/escape_char in these files Mike Frysinger
2016-03-09 22:24 ` Mike Frysinger
2016-03-11 10:31 ` Marko Myllynen
2016-03-11 14:24 ` Chris Leonard
2016-03-12 12:19 ` Marko Myllynen
2016-03-12 19:10 ` Chris Leonard
2016-03-13 17:50 ` Chris Leonard
2016-03-14 18:06 ` Carlos O'Donell
2016-04-11 18:58 ` Mike Frysinger [this message]
2016-04-11 20:46 ` Chris Leonard
2016-04-12 2:11 ` Mike Frysinger
2016-04-12 2:50 ` Chris Leonard
2016-03-14 18:20 ` Carlos O'Donell
2016-03-14 19:43 ` keld
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160411185850.GL6588@vapier.lan \
--to=vapier@gentoo.org \
--cc=carlos@redhat.com \
--cc=cjlhomeaddress@gmail.com \
--cc=fweimer@redhat.com \
--cc=libc-alpha@sourceware.org \
--cc=myllynen@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).