From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by sourceware.org (Postfix) with ESMTPS id 9BA653858C83 for ; Tue, 19 Apr 2022 13:43:55 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 9BA653858C83 Received: from mail-wr1-f69.google.com (mail-wr1-f69.google.com [209.85.221.69]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-317-Srl4ChvZMZuDRxyHTyEYXg-1; Tue, 19 Apr 2022 09:43:53 -0400 X-MC-Unique: Srl4ChvZMZuDRxyHTyEYXg-1 Received: by mail-wr1-f69.google.com with SMTP id e2-20020adfa442000000b0020a91fa37b9so684254wra.10 for ; Tue, 19 Apr 2022 06:43:52 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:organization:references:date :in-reply-to:message-id:user-agent:mime-version :content-transfer-encoding; bh=GkzSgVkS+HxOUSXR4DiJE//eF9b+TsGvbwldzUxxxgQ=; b=D0y4n3O9AFzsIwBRG3Ikg7CphNQiiIcdIEZatfYKhZUNB15MMwBJgyiHu5YRmn0c5B B5rwyDOUxEKCSTiC9LnjCylSTtpIvlep1/mIHbcgGCWM27qmq6SD/wqlqwiQd8Wz8Ks5 hDz7EKEOHMQLqhoDrPpKfnSJkzvLIka0PpV6yig9MHfsHWUlAL9cDpPzTWa+QOIa5d07 RGDZwt32RkjUDnjBXxeEagNKhHcpmh6+CmnQOUnkDg0y8oTA5MkeRJoe/To4T2O+TJaa sZT+8k1YPW2Th+E1OWUFr4bRDsFqIfq7/bGcGjfV9eIGZcVx69jz25lcNXM3t++6FX2i EnIw== X-Gm-Message-State: AOAM532A0dOLsUMXWgPYpVNy73hLEvezW+9IHnEsUqt2maXZxjLo/mj5 P8FWaZBgqCbUIE9vMgc5zgDMLirP0ov9b2MiuyTWO4D0jHfqMc7BNBdMRXMgL1LqycWAVhmB51l KQawe6gaO6fjxx2mmBOI= X-Received: by 2002:a05:600c:4e92:b0:392:89ef:55c8 with SMTP id f18-20020a05600c4e9200b0039289ef55c8mr15250917wmq.69.1650375832047; Tue, 19 Apr 2022 06:43:52 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzobXzKuJ1AhNt87M8eS9DAvOgvKuzGGrqyhNwnHSrn3LyDmIhfmHAZDYjwTWvf/WQkO3fJLQ== X-Received: by 2002:a05:600c:4e92:b0:392:89ef:55c8 with SMTP id f18-20020a05600c4e9200b0039289ef55c8mr15250897wmq.69.1650375831802; Tue, 19 Apr 2022 06:43:51 -0700 (PDT) Received: from taka.site (ppp-188-174-142-87.dynamic.mnet-online.de. [188.174.142.87]) by smtp.gmail.com with ESMTPSA id t16-20020a5d5350000000b0020a99bc986bsm4451102wrv.28.2022.04.19.06.43.51 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 19 Apr 2022 06:43:51 -0700 (PDT) Received: from taka.site (localhost [IPv6:::1]) by taka.site (Postfix) with ESMTP id A10D8E; Tue, 19 Apr 2022 15:43:50 +0200 (CEST) From: Mike FABIAN To: "Emil Soleyman-Zomalan" Cc: "Florian Weimer" , libc-alpha@sourceware.org Subject: Re: [PATCH 0/2] localedata: Add locale for syr_SY [BZ #27063] Organization: Red Hat References: <20220125013310.182786-1-emil@soleyman.com> <9ae4a7ae-cdd1-4dca-a5b7-1d7f818d8a4e@www.fastmail.com> <87zgkyqd60.fsf@oldenburg.str.redhat.com> <51087442-c05d-438a-816f-53f9c3c7972e@www.fastmail.com> X-Face: "'; oPz9V1+<,`}1ZuxRv~EiSusWq*{Yjr"Sdvbhq'?q=2R\\6Y9O/,SAE`{J|6I=|w/sQg< rW_N'E3IV6~f8?\l#Es`]S`mv',PY(`8{$$R?+gLu}Qv/Mn>)?uladFjJ@yl!_p_Jh; 5QxlD6zL:?r IXe4FfK$C^mWhh$o`yt; .r.FLZLQOWBt> (Emil Soleyman-Zomalan's message of "Sat, 09 Apr 2022 12:39:41 -0500") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.5 required=5.0 tests=BAYES_05, BODY_8BITS, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H4, RCVD_IN_MSPIKE_WL, SPF_HELO_NONE, SPF_NONE, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 19 Apr 2022 13:43:57 -0000 "Emil Soleyman-Zomalan" さんはかきました: > On Wed, Apr 6, 2022, at 14:10, Emil Soleyman-Zomalan wrote: >> >> I can tell you that Microsoft Windows has had a Syriac locale since >> Windows 8 and uses the Syrian Pound as a currency but no telephone >> prefix. >> >> Given that we are a country-less and region-less people, it might make >> sense to go with a country-less locale like eo. I know that there are >> pros and cons to this but nothing is coming to mind right now. >> > > I have added the syr_SY.UTF-8.in file to bugzilla#27063. I would like > to say let's move forward with the country-less version of the Syriac > locale. I am working on adding this to glibc now. I used the locale attached to the bug and the syr_SY.UTF-8.in from the bug and renamed it to syr.UTF-8.in The sorting test case fails like this: syr.UTF-8 collate-test FAIL --- syr.UTF-8.in 2022-04-19 13:07:31.675953523 +0200 +++ /local/mfabian/src/glibc-build/localedata/syr.UTF-8.out 2022-04-19 14:42:15.977224807 +0200 @@ -1,14 +1,14 @@ ; Symbol Name Hex Code ; ------+------+---------------------------------------+--------+ -ܐ ; Syriac Letter Alaph U+0710 ܑ ; Syriac Letter Superscript Alaph U+0711 +ܐ ; Syriac Letter Alaph U+0710 ܒ ; Syriac Letter Beth U+0712 ܭ ; Syriac Letter Persian Bheth U+072D ܓ ; Syriac Letter Gamal U+0713 ܔ ; Syriac Letter Gamal Garshuni U+0714 ܮ ; Syriac Letter Persian Ghamal U+072E -ܕ ; Syriac Letter Dalath U+0715 ܖ ; Syriac Letter Dotless Dalath Rish U+0716 +ܕ ; Syriac Letter Dalath U+0715 ܯ ; Syriac Letter Persian Dhalath U+072F ܗ ; Syriac Letter He U+0717 ܘ ; Syriac Letter Waw U+0718 Your locale uses LC_COLLATE copy "iso14651_t1" END LC_COLLATE i.e. it includes the file with the default Unicode collation. Your test file has two characters in a different order. If the default Unicode collation order is OK for you, I would fix the test file accordingly (My preferred solution, I would deviate from the default only if necessary). If these two characters really should be ordered as in your test file, I would add extra rules to the LC_COLLATE section to achieve that: LC_COLLATE copy "iso14651_t1" ... more rules here ... END LC_COLLATE But that is extra effort and I wonder whether this is needed. There is no collation information in CLDR yet, no file common/collation/syr.xml exists currently in CLDR. You said you are also the contributor to the Unicode CLDR for Syriac. But you didn’t add a collation file there. If that means the default order is OK, then just doing LC_COLLATE copy "iso14651_t1" END LC_COLLATE should be enough and I should fix the test file. -- Mike FABIAN 睡眠不足はいい仕事の敵だ。