From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 24728 invoked by alias); 14 Oct 2008 08:06:47 -0000 Received: (qmail 24238 invoked by uid 22791); 12 Oct 2008 15:53:24 -0000 X-Spam-Status: No, hits=0.7 required=5.0 tests=BAYES_50,SPF_NEUTRAL X-Spam-Check-By: sourceware.org X-TPG-Antivirus: Passed Subject: QUESTION: LC_COLLATE minimal requirements? From: Harshula To: libc-locales@sources.redhat.com Cc: Pravin S Content-Type: text/plain Date: Tue, 14 Oct 2008 08:06:00 -0000 Message-Id: <1223826729.4898.72.camel@B1.HOME> Mime-Version: 1.0 X-Mailer: Evolution 2.22.3.1 Content-Transfer-Encoding: 7bit Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org X-SW-Source: 2008-q4/txt/msg00003.txt.bz2 Hi, I was unable to find much documentation on LC_COLLATE except for [1]. Hence I have a few questions. Firstly, some background information. The Sinhala collation sequence (SLS1134) is relatively simple. * It does not have multiple characters mapping to a single collation element. * It does not consider composed and decomposed dependent vowels as equivalent [2]. * It does not have to deal with secondary and tertiary weights. * It has a few simple tailoring rules [3] that need to be applied to the DUCET [4]. Q1) Is it a requirement to use the collating-symbol keyword to define ALL symbols? If not, is this patch sufficient and acceptable for glibc? http://cvs.savannah.gnu.org/viewvc/sinhala/patches/iso14651_t1_common-glibc.patch?root=sinhala&view=log Q2) Instead of explicitly listing all the characters in order, is it possible to use the reorder-after keyword to only define variations to the DUCET? Q3) I couldn't find any documentation on: translit_start include "translit_combining";"" translit_end /usr/share/i18n/locales/translit_combining ------------------------------------------ % SINHALA VOWEL SIGN DIGA KOMBUVA "" % SINHALA VOWEL SIGN KOMBUVA HAA AELA-PILLA "" % SINHALA VOWEL SIGN KOMBUVA HAA DIGA AELA-PILLA "" % SINHALA VOWEL SIGN KOMBUVA HAA GAYANUKITTA "" ------------------------------------------ Does translit_start have an affect on LC_COLLATE? Thanks, # [1] http://www.opengroup.org/onlinepubs/009695399/basedefs/xbd_chap07.html [2] http://sourceforge.net/mailarchive/forum.php?thread_name=1223803982.4898.16.camel%40B1.HOME&forum_name=sinhala-technical [3] http://www.nongnu.org/sinhala/doc/howto/sinhala-howto.html#DEV-DATABASES [4] http://unicode.org/Public/UCA/latest/allkeys.txt