public inbox for glibc-bugs@sourceware.org help / color / mirror / Atom feed
From: "alex at gorka dot lv" <sourceware-bugzilla@sourceware.org> To: glibc-bugs@sourceware.org Subject: [Bug localedata/15537] New: Invalid collation for Latvian diacritical letters Date: Sun, 26 May 2013 14:22:00 -0000 [thread overview] Message-ID: <bug-15537-131@http.sourceware.org/bugzilla/> (raw) http://sourceware.org/bugzilla/show_bug.cgi?id=15537 Bug ID: 15537 Summary: Invalid collation for Latvian diacritical letters Product: glibc Version: 2.18 Status: NEW Severity: normal Priority: P2 Component: localedata Assignee: unassigned at sourceware dot org Reporter: alex at gorka dot lv CC: libc-locales at sourceware dot org Latvian language locale for Latvia has wrong collation order for Latvian vowels: A MACRON (U0100, U0101), E MACRON (U0112, U0113), I MACRON (U012A, U012B), O MACRON (U014C, U014D), and U MACRON (U016A, U016B). The first weight specifier for these letters should be equal to base letter (A, E, I, O, and U, respectively), and only the second weight specifier must be heavier. In other words, letters with macrons are sorted after the same letters without macrons only when string parts after the letter are equal. Note that diacritical consonants - C CARON, G CEDILLA, K CEDILLA, L CEDILLA, N CEDILLA, S CARON, and Z CARON - are always sorted after their base letters; for these letters the first weight specifier must be different, and that is correct with current version of the Latvian locale. Besides, current version of Latvian locale contains letter R WITH CEDILLA (U0156, U0157), which is now sorted separately from letter R with other diacritical marks. This letter is not currently used for Latvian writing in Latvia (it was used in the first half of the 20th century, and is still used by some Latvian communities outside Latvia), so the sorting rules for this letter are not obvious. I think that it would be better to make the first weight for letter R WITH CEDILLA equal to R because most of current Latvian language users cannot say when to use R with cedilla instead of R. Finally, current version of Latvian locale sorts capital letters before small letters, and that is not consistent with ISO14651 rules used by many glibc locales; some users complain about that too. -- You are receiving this mail because: You are on the CC list for the bug.
next reply other threads:[~2013-05-26 14:22 UTC|newest] Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top 2013-05-26 14:22 alex at gorka dot lv [this message] 2014-06-13 17:36 ` [Bug localedata/15537] " fweimer at redhat dot com
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=bug-15537-131@http.sourceware.org/bugzilla/ \ --to=sourceware-bugzilla@sourceware.org \ --cc=glibc-bugs@sourceware.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).