From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 31034 invoked by alias); 17 Aug 2009 12:18:52 -0000 Received: (qmail 30982 invoked by uid 48); 17 Aug 2009 12:18:36 -0000 Date: Mon, 17 Aug 2009 12:18:00 -0000 Message-ID: <20090817121836.30981.qmail@sourceware.org> From: "sayamindu at gmail dot com" To: libc-locales@sources.redhat.com In-Reply-To: <20090809051246.10501.santhosh.thottingal@gmail.com> References: <20090809051246.10501.santhosh.thottingal@gmail.com> Reply-To: sourceware-bugzilla@sourceware.org Subject: [Bug localedata/10501] bn_IN collation does not have canonical equivalence definitions X-Bugzilla-Reason: AssignedTo Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org X-SW-Source: 2009-q3/txt/msg00035.txt.bz2 ------- Additional Comments From sayamindu at gmail dot com 2009-08-17 12:18 ------- (In reply to comment #2) > Refer the collation rules of UCA - > http://www.unicode.org/Public/UCA/latest/allkeys.txt > [...] > 09CB ; [.1B48.0020.0002.09CB] # BENGALI VOWEL SIGN O > 09C7 09BE ; [.1B48.0020.0002.09CB] # BENGALI VOWEL SIGN O > 09CC ; [.1B49.0020.0002.09CC] # BENGALI VOWEL SIGN AU > 09C7 09D7 ; [.1B49.0020.0002.09CC] # BENGALI VOWEL SIGN AU > [...] > > It is implemented in UCA and should be available in glibc localedata too. ie, > Collation weights of canonically equivalent sequences should be explicitly > defined in glibc and there should not be any assumption on the input to the > collation. > I would tend to second Santhosh here, since we do not know where the data might be coming from (eg: someone might try to assume a shortcut while implementing a legacy encoding -> unicode converter, etc) -- http://sourceware.org/bugzilla/show_bug.cgi?id=10501 ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee.