public inbox for glibc-bugs@sourceware.org
help / color / mirror / Atom feed
* [Bug localedata/368] New: localedef fails with coplex LC_COLLATE rules
@ 2004-09-05 20:49 pablo at mandrakesoft dot com
  2004-09-05 20:50 ` [Bug localedata/368] " pablo at mandrakesoft dot com
                   ` (9 more replies)
  0 siblings, 10 replies; 11+ messages in thread
From: pablo at mandrakesoft dot com @ 2004-09-05 20:49 UTC (permalink / raw)
  To: glibc-bugs

I reached what seems to be a limitation in the numlber of LC_COLLATE collating-elements.

I was trying to build a dz_BT locale (Dzongkha language, Buthan);
the sorting rules are quite special, as for example nexy to <ka> entry are words starting with prefix attached to ka radical, eg: <da>-<ka>, <ba>-<ka> etc, come just after words starting with <ka>, and not with words starting with <da>, <ba>, etc.
Said otherwise, the base collating elements are the 30 base letters, plus 103 prefix-radical collating elements.
Now, it is even more complex that that; some letter sequences are prefix-radical or not depending on what follows them; eg for <da>-<ga> it is a prefix if followed with <ga>, <nga>, <da>,... but not otherwise.
That is, it is needed to define collating elements comprising of the prefix element and the next char, which are then sorted as a digraph; eg:
collating-element <rad-ga-d-ga> from "<U0F51><U0F42><U0F42>"
...
<rad-ga-d-ga>  "<TIB-GA-R_D><TIB-GA>";....

That mens there are a lot of collating-elements to define; 303 in total.
But ifI use more than 265 the locale doesn't compile (localedef just runs forever taking 90% of cpu ressources doing nothing); while if I comment some of them to have no more than 265 in use, then it compiles nicely.

I attach the preliminary dz_BT locale I was working on; some entries are commented with %%%% (four percent signs), so that the file can compile; but to have the rules complete, all those lines commented out with "%%%%" should be enabled as well.

-- 
           Summary: localedef fails with coplex LC_COLLATE rules
           Product: glibc
           Version: unspecified
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
        AssignedTo: pere at hungry dot com
        ReportedBy: pablo at mandrakesoft dot com
                CC: glibc-bugs at sources dot redhat dot com


http://sources.redhat.com/bugzilla/show_bug.cgi?id=368

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2005-10-14 22:57 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2004-09-05 20:49 [Bug localedata/368] New: localedef fails with coplex LC_COLLATE rules pablo at mandrakesoft dot com
2004-09-05 20:50 ` [Bug localedata/368] " pablo at mandrakesoft dot com
2005-01-02 23:26 ` barbier at linuxfr dot org
2005-01-17 21:38 ` barbier at linuxfr dot org
2005-01-17 22:33 ` barbier at linuxfr dot org
2005-01-17 22:33 ` barbier at linuxfr dot org
2005-08-03 10:54 ` cfynn at gmx dot net
2005-08-03 11:37 ` cfynn at gmx dot net
2005-10-14 21:11 ` drepper at redhat dot com
2005-10-14 22:56 ` drepper at redhat dot com
2005-10-14 22:57 ` drepper at redhat dot com

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).