public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug localedata/15537] New: Invalid collation for Latvian diacritical letters
@ 2013-05-26 14:22 alex at gorka dot lv
  2014-06-13 17:36 ` [Bug localedata/15537] " fweimer at redhat dot com
  0 siblings, 1 reply; 2+ messages in thread
From: alex at gorka dot lv @ 2013-05-26 14:22 UTC (permalink / raw)
  To: glibc-bugs

http://sourceware.org/bugzilla/show_bug.cgi?id=15537

            Bug ID: 15537
           Summary: Invalid collation for Latvian diacritical letters
           Product: glibc
           Version: 2.18
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
          Assignee: unassigned at sourceware dot org
          Reporter: alex at gorka dot lv
                CC: libc-locales at sourceware dot org

Latvian language locale for Latvia has wrong collation order for Latvian
vowels: A MACRON (U0100, U0101), E MACRON (U0112, U0113), I MACRON (U012A,
U012B), O MACRON (U014C, U014D), and U MACRON (U016A, U016B).  The first weight
specifier for these letters should be equal to base letter (A, E, I, O, and U,
respectively), and only the second weight specifier must be heavier.  In other
words, letters with macrons are sorted after the same letters without macrons
only when string parts after the letter are equal.

Note that diacritical consonants - C CARON, G CEDILLA, K CEDILLA, L CEDILLA, N
CEDILLA, S CARON, and Z CARON - are always sorted after their base letters; for
these letters the first weight specifier must be different, and that is correct
with current version of the Latvian locale.

Besides, current version of Latvian locale contains letter R WITH CEDILLA
(U0156, U0157), which is now sorted separately from letter R with other
diacritical marks.  This letter is not currently used for Latvian writing in
Latvia (it was used in the first half of the 20th century, and is still used by
some Latvian communities outside Latvia), so the sorting rules for this letter
are not obvious.  I think that it would be better to make the first weight for
letter R WITH CEDILLA equal to R because most of current Latvian language users
cannot say when to use R with cedilla instead of R.

Finally, current version of Latvian locale sorts capital letters before small
letters, and that is not consistent with ISO14651 rules used by many glibc
locales; some users complain about that too.

-- 
You are receiving this mail because:
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 2+ messages in thread

* [Bug localedata/15537] Invalid collation for Latvian diacritical letters
  2013-05-26 14:22 [Bug localedata/15537] New: Invalid collation for Latvian diacritical letters alex at gorka dot lv
@ 2014-06-13 17:36 ` fweimer at redhat dot com
  0 siblings, 0 replies; 2+ messages in thread
From: fweimer at redhat dot com @ 2014-06-13 17:36 UTC (permalink / raw)
  To: glibc-bugs

https://sourceware.org/bugzilla/show_bug.cgi?id=15537

Florian Weimer <fweimer at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
              Flags|                            |security-

-- 
You are receiving this mail because:
You are on the CC list for the bug.


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2014-06-13 17:36 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-05-26 14:22 [Bug localedata/15537] New: Invalid collation for Latvian diacritical letters alex at gorka dot lv
2014-06-13 17:36 ` [Bug localedata/15537] " fweimer at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).