From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 32300 invoked by alias); 23 Dec 2014 04:25:38 -0000 Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org Received: (qmail 32214 invoked by uid 48); 23 Dec 2014 04:25:30 -0000 From: "aoliva at sourceware dot org" To: glibc-bugs@sourceware.org Subject: [Bug localedata/17750] New: wrong collation order of diacritics in most locales Date: Tue, 23 Dec 2014 04:25:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: unspecified X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: aoliva at sourceware dot org X-Bugzilla-Status: NEW X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: unassigned at sourceware dot org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version bug_status bug_severity priority component assigned_to reporter cc Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-12/txt/msg00191.txt.bz2 https://sourceware.org/bugzilla/show_bug.cgi?id=3D17750 Bug ID: 17750 Summary: wrong collation order of diacritics in most locales Product: glibc Version: unspecified Status: NEW Severity: normal Priority: P2 Component: localedata Assignee: unassigned at sourceware dot org Reporter: aoliva at sourceware dot org CC: libc-locales at sourceware dot org http://www.unicode.org/reports/tr10/tr10-30.html states: Normally, all differences in sorting are assessed from the start to the end= of the string. If all of the base letters are the same, the first accent difference determines the final order. In row 1 of Table 5, the first accent difference is on the o, so that is what determines the order. In some French dictionary ordering traditions, however, it is the last accent difference t= hat determines the order, as shown in row 2. Table 5 says:
Normal Accent Ordering      cote < cot=C3=A9 < c=C3=B4te < c=C3=B4t=C3=A9
Backward Accent Ordering     cote < c=C3=B4te < cot=C3=A9 < c=C3=B4t=C3=A9
However, glibc implements backward accent ordering for all locales except d= e_DE and lb_LU.=20=20 Unicode CLDR 26 confirms this is wrong: the only file in http://unicode.org/cldr/trac/browser/tags/release-26/common/collation/ that= has settings backwards=3D"on" is fr_CA.xml. --=20 You are receiving this mail because: You are on the CC list for the bug. >>From glibc-bugs-return-26949-listarch-glibc-bugs=sources.redhat.com@sourceware.org Tue Dec 23 04:30:15 2014 Return-Path: Delivered-To: listarch-glibc-bugs@sources.redhat.com Received: (qmail 1839 invoked by alias); 23 Dec 2014 04:30:14 -0000 Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org Delivered-To: mailing list glibc-bugs@sourceware.org Received: (qmail 1764 invoked by uid 48); 23 Dec 2014 04:30:08 -0000 From: "aoliva at sourceware dot org" To: glibc-bugs@sourceware.org Subject: [Bug localedata/17750] wrong collation order of diacritics in most locales Date: Tue, 23 Dec 2014 04:30:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: unspecified X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: aoliva at sourceware dot org X-Bugzilla-Status: NEW X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: aoliva at sourceware dot org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: assigned_to Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-12/txt/msg00192.txt.bz2 Content-length: 566 https://sourceware.org/bugzilla/show_bug.cgi?id=17750 Alexandre Oliva changed: What |Removed |Added ---------------------------------------------------------------------------- Assignee|unassigned at sourceware dot org |aoliva at sourceware dot org --- Comment #1 from Alexandre Oliva --- Mine. I posted a patch at https://sourceware.org/ml/libc-alpha/2014-12/msg00524.html -- You are receiving this mail because: You are on the CC list for the bug.