From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 0A69638582AD; Fri, 5 Jan 2024 11:38:31 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 0A69638582AD DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1704454712; bh=mcuTJCj3Xv9HHkgxiC4qNgzWWO+yofUzEIK40fIQTys=; h=From:To:Subject:Date:In-Reply-To:References:From; b=TJI+314WlWJWtBRILrArEj+H+YvSHkJUk/PzA1N0C/VbmN/QT0A5gxWvc6RRtMPII BRrB7aMJnyQmDGP+Ks8oxt7gyQuDiqgibZa/Gnz++roudlG50KKYzoxiaku/aDrxDm ln0eQ27JquY0BqqdzJtag1nlGjywKnG1U+3Ln+hg= From: "maiku.fabian at gmail dot com" To: libc-locales@sourceware.org Subject: [Bug localedata/10502] sorting between Indic Languages should be as per unicode code point Date: Fri, 05 Jan 2024 11:38:31 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: 2.10 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: maiku.fabian at gmail dot com X-Bugzilla-Status: NEW X-Bugzilla-Resolution: X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: libc-locales at sourceware dot org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: cc Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 List-Id: https://sourceware.org/bugzilla/show_bug.cgi?id=3D10502 Mike FABIAN changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |maiku.fabian at gmail dot = com --- Comment #3 from Mike FABIAN --- In 2018, we updated the iso14651_t1_common to a 2016 version and then adapt= ed the sort order of many locales. So the sort order of these Indic languages should now be in sync with the DUCET (http://www.unicode.org/Public/UCA/latest/allkeys.txt) as approximately def= ined in 2016. So I think the problem in the original comment is fixed. commit 9479b6d5e08eacce06c6ab60abc9b2f4eb8b71e4 Author: Mike FABIAN Date: Tue Jan 30 17:59:00 2018 +0100 Update iso14651_t1_common file to ISO14651_2016_TABLE1_en.txt [BZ #1409= 5] [BZ #14095] - Review / update collation data from Unicode / ISO 14651 File downloaded from: http://standards.iso.org/iso-iec/14651/ed-4/ISO14651_2016_TABLE1_en.txt Updating this file alone is not enough, there are problems in the new file which need to be fixed and the collation rules for many locales need to be adapted. This is done by the following patches. This update also fixes the problem that many characters are treated as identical when sorting because they were not yet in the old iso14651_t1_common file, see: https://bugzilla.redhat.com/show_bug.cgi?id=3D1336308 - Infinite (=E2=88=9E) and empty set (=E2=88=85) are treated as if they= were the same character by sort and uniq [BZ #14095] * localedata/locales/iso14651_t1_common: Update file to latest version from ISO (ISO14651_2016_TABLE1_en.txt). --=20 You are receiving this mail because: You are the assignee for the bug.=