From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 117057 invoked by alias); 25 Jul 2018 21:12:59 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: libc-alpha-owner@sourceware.org Received: (qmail 117047 invoked by uid 89); 25 Jul 2018 21:12:58 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-1.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_NONE autolearn=ham version=3.3.2 spammy=Hx-languages-length:1207, 892, 893 X-HELO: mail-qt0-f181.google.com Return-Path: Subject: Re: [PATCHv3] Expected behaviour for a-z, A-Z, and 0-9 (Bug 23393). To: Rafal Luzynski , GNU C Library , Mike Fabian , Florian Weimer , "Joseph S. Myers" References: <9d6f47ec-f9eb-ead0-889c-3b9aae66551c@redhat.com> <5bcef059-b928-d2e9-82dd-2ae68be96020@redhat.com> <541d18da-6318-382e-d5cd-6c69a5db1a07@redhat.com> <8359bdf2-457e-e2f1-ac90-e4b27b2e0495@redhat.com> <2083576605.587313.1532552765063@poczta.nazwa.pl> From: Carlos O'Donell Message-ID: Date: Wed, 25 Jul 2018 21:12:00 -0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0 MIME-Version: 1.0 In-Reply-To: <2083576605.587313.1532552765063@poczta.nazwa.pl> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-SW-Source: 2018-07/txt/msg00856.txt.bz2 On 07/25/2018 05:06 PM, Rafal Luzynski wrote: > 25.07.2018 22:25 Carlos O'Donell wrote: >> [...] >> I had to fix the following locales: >> >> modified: localedata/locales/ar_SA >> modified: localedata/locales/km_KH >> modified: localedata/locales/lo_LA >> modified: localedata/locales/or_IN >> modified: localedata/locales/sl_SI >> modified: localedata/locales/th_TH >> >> They all re-arranged ASCII character collation element ordering like tr_TR, >> and so they needed manual fixing. > > Please check bg_BG. It also has a large reorder: puts all Cyrillic characters > before Latin. (However, this may not be relevant at all.) Right, that won't affect the rational range for ASCII. The new tst-fnmatch.input has this: 886 bg_BG.UTF-8 "a" "[a-z]" 0 887 bg_BG.UTF-8 "z" "[a-z]" 0 888 bg_BG.UTF-8 "A" "[a-z]" NOMATCH 889 bg_BG.UTF-8 "Z" "[a-z]" NOMATCH 890 bg_BG.UTF-8 "A" "[A-Z]" 0 891 bg_BG.UTF-8 "Z" "[A-Z]" 0 892 bg_BG.UTF-8 "a" "[A-Z]" NOMATCH 893 bg_BG.UTF-8 "z" "[A-Z]" NOMATCH Which tests the range extremes, and it passes. It doesn't reorder any actual LATIN characters and so it's safe. Cheers, Carlos.