From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 83858 invoked by alias); 22 Dec 2018 19:56:32 -0000 Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org Received: (qmail 81761 invoked by uid 48); 22 Dec 2018 19:56:28 -0000 From: "rei4dan at gmail dot com" To: libc-locales@sourceware.org Subject: [Bug localedata/23774] lv_LV collates Y/y incorrectly Date: Sat, 22 Dec 2018 19:56:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: unspecified X-Bugzilla-Keywords: X-Bugzilla-Severity: minor X-Bugzilla-Who: rei4dan at gmail dot com X-Bugzilla-Status: UNCONFIRMED X-Bugzilla-Resolution: X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: unassigned at sourceware dot org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: cc Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2018-q4/txt/msg00156.txt.bz2 https://sourceware.org/bugzilla/show_bug.cgi?id=3D23774 Reinis Danne changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |rei4dan at gmail dot com --- Comment #3 from Reinis Danne --- sed-4.6 and grep-3.3 seem to have resolved this particular issue by implementing rational range interpretation, but [a-=C5=BE] and [A-=C5=BD] a= re buggy. The former de-interleaves the capital letters for unaccented characters, but accented capitals are left among the small letters. Does glibc (2.28) offer alternative collations (or does grep does it)? As far as I could tell the collation sequence is as specified in the locale: Using LC_COLLATE=3Dlv_LV.UTF-8 char strxfrm i c2b7010201020101e29b96 I c2b7010201070101e2afb7 =C4=AB c2b70102140102020101e29bb7 =C4=AA c2b70102140107020101e2b096 y c2b701030102 Y c2b701030107 j c382010201020101e29c96 J c382010201070101e2b0a4 Using LC_COLLATE=3DC.UTF-8 char strxfrm i 6b I 4b =C4=AB c4ad =C4=AA c4ac y 7b Y 5b j 6c J 4c --=20 You are receiving this mail because: You are on the CC list for the bug.