From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 72821 invoked by alias); 18 Jun 2018 10:04:33 -0000 Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org Received: (qmail 72717 invoked by uid 48); 18 Jun 2018 10:04:28 -0000 From: "digitalfreak at lingonborough dot com" To: libc-locales@sourceware.org Subject: [Bug localedata/23140] More languages need two forms of month names Date: Mon, 18 Jun 2018 10:04:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: 2.27 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: digitalfreak at lingonborough dot com X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: unassigned at sourceware dot org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2018-q2/txt/msg00097.txt.bz2 https://sourceware.org/bugzilla/show_bug.cgi?id=3D23140 --- Comment #34 from Rafal Luzynski = --- I can't even read Armenian script but I'm trying to figure out what's going= on here. Armenian alphabet seems to have upper and lowercase, same as Latin, Cyrillic, and Greek. Glibc currently seems to provide the month names star= ting with uppercase. It is easy to convert between uppercase and lowercase: uppercase code points are 0x30 less than lowercase. For example, current March: "" Decoded: "=D5=84=D5=A1=D6=80=D5=BF=D5=AB" Should be transliterated as: "Marti" My patch which is an import from CLDR converts it to: alt_mon: "" =3D> "=D5=B4=D5=A1=D6=80=D5= =BF" =3D> "mart" mon: "" =3D> "=D5=B4=D5=A1=D6=80=D5= =BF=D5=AB" =3D> "marti" So it looks like the locale data currently contain the month names in a genitive case (or whatever is correct to use in a full date) starting with uppercase while CLDR uses lowercase exclusively. Please note that we had a similar case in Lithuanian language already. But Armenian language seems to use "month day" order, same as English, so t= he month name may tend to appear at the beginning of a sentence and it may be reasonable to start it with an uppercase even if there is no rule in a lang= uage which says that the month names should be always titlecased. If mon array = is titlecased then alt_mon should be as well (CLDR suggest that alt_mon may be titlecased and mon lowercased but not the reverse) so maybe we should use a titlecase for alt_mon as well. Converting this to a titlecase the decoded patch for hy_AM would look like this: diff --git a/localedata/locales/hy_AM b/localedata/locales/hy_AM index 805c327..30033a9 100644 --- a/localedata/locales/hy_AM +++ b/localedata/locales/hy_AM @@ -130,6 +130,18 @@ abmon "=D5=80=D5=B6=D5=BE";/ "=D5=80=D5=B8=D5=AF";/ "=D5=86=D5=B4=D5=A2";/ "=D4=B4=D5=A5=D5=AF" +alt_mon "=D5=80=D5=B8=D6=82=D5=B6=D5=BE=D5=A1=D6=80";/ + "=D5=93=D5=A5=D5=BF=D6=80=D5=BE=D5=A1=D6=80";/ + "=D5=84=D5=A1=D6=80=D5=BF";/ + "=D4=B1=D5=BA=D6=80=D5=AB=D5=AC";/ + "=D5=84=D5=A1=D5=B5=D5=AB=D5=BD";/ + "=D5=80=D5=B8=D6=82=D5=B6=D5=AB=D5=BD";/ + "=D5=80=D5=B8=D6=82=D5=AC=D5=AB=D5=BD";/ + "=D5=95=D5=A3=D5=B8=D5=BD=D5=BF=D5=B8=D5=BD";/ + "=D5=8D=D5=A5=D5=BA=D5=BF=D5=A5=D5=B4=D5=A2=D5=A5=D6=80";/ + "=D5=80=D5=B8=D5=AF=D5=BF=D5=A5=D5=B4=D5=A2=D5=A5=D6=80";/ + "=D5=86=D5=B8=D5=B5=D5=A5=D5=B4=D5=A2=D5=A5=D6=80";/ + "=D4=B4=D5=A5=D5=AF=D5=BF=D5=A5=D5=B4=D5=A2=D5=A5=D6=80" mon "=D5=80=D5=B8=D6=82=D5=B6=D5=BE=D5=A1=D6=80=D5=AB";/ "=D5=93=D5=A5=D5=BF=D6=80=D5=BE=D5=A1=D6=80=D5=AB";/ "=D5=84=D5=A1=D6=80=D5=BF=D5=AB";/ I'm still looking for the people to confirm this. Links: http://st.unicode.org/cldr-apps/v#/hy/Gregorian/ https://www.fileformat.info/info/unicode/block/armenian/list.htm https://en.wikipedia.org/wiki/Armenian_alphabet https://hy.wikipedia.org/wiki/%D5%84%D5%A1%D6%80%D5%BF%D5%AB_1 (this is an example of an Armenian date) --=20 You are receiving this mail because: You are on the CC list for the bug.