From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 64334 invoked by alias); 9 Jan 2019 00:46:36 -0000 Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org Received: (qmail 64312 invoked by uid 89); 9 Jan 2019 00:46:36 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-0.9 required=5.0 tests=BAYES_00,KAM_LAZY_DOMAIN_SECURITY,RCVD_IN_DNSWL_NONE autolearn=no version=3.3.2 spammy=tied, Country, country, countries X-HELO: mout.kundenserver.de Subject: Re: [PATCH v12] Locales: Cyrillic -> ASCII transliteration [BZ #2872] To: Marko Myllynen , Rafal Luzynski , libc-alpha@sourceware.org, libc-locales@sourceware.org, Carlos O'Donell , Siddhesh Poyarekar Cc: Mike Fabian References: <41532e13-a63d-5df1-ab37-05eb4d6c8d0a@kobylkin.com> <20180412224352.GB2911@altlinux.org> <2124833400.35614.1546698902753@poczta.nazwa.pl> <908ed415-cfe4-804c-f421-4351ef062edc@kobylkin.com> From: Egor Kobylkin Message-ID: <6d076299-babd-406a-b1fe-87778f54bf36@kobylkin.com> Date: Wed, 09 Jan 2019 00:46:00 -0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.2.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-SW-Source: 2019-q1/txt/msg00022.txt.bz2 On 07.01.19 21:37, Marko Myllynen wrote: > Hi, > > On 05/01/2019 23.12, Egor Kobylkin wrote: >> On 05.01.19 15:35, Rafal Luzynski wrote: >>> 2.01.2019 19:38 Egor Kobylkin wrote: >>>> >>>> Changelog v12: >>>> [...] >>>> >>>> Changelog v11: >>>> * Re-targeted the patch against locale/C-translit.h.in as the proper >>>> file for the ASCII translit table. >>>> * Correspondingly the patch now only contains the additional >>>> Cyrillic-ASCII strings in the format of locale/C-translit.h.in table. >>>> The 'include "translit_cyrillic";""' directives are not necessary in the >>>> locale files and they are now all left intact. >>>> * Also the file translit_cyrillic is not longer needed and is omitted. >>>> * Edited below email, commit message. >>>> [...] >>> >>> I have tested this and, unfortunately, now this transliteration >>> works *only* in C locale, that is, only when no locale is set or when >>> it is explicitly set to C (C.UTF8, POSIX).  It does not work when locale >>> is set to anything different, including en_US, ru_RU, etc. >> >> Good catch! Should we maybe split this into two patches, one for C and >> the other for "country" locales? They have different codes and >> functionality so it looks like it would be easier to keep focus. > > That would probably make sense, the standard C/POSIX locale won't > support System A so it also narrows down solution alternatives with it. > [SNIP] >> "Country" locales in localedata/locales/ can then have the exact same >> translit table included or they can have any other flavor - I don't see >> a problem here. > > Indeed, and since those files are not limited to ASCII, perhaps we could > now reconsider the v9 approach for them, i.e., prefer System A if > possible, otherwise use System B / ASCII (just need to make sure that > the ASCII fall-back for them will match the built-in C ASCII rule)? > Happy to hear the split seems to be a clear cut one. How about I rename the "[PATCH v12]...[BZ #2872]" to "[PATCH v1]... C/POSIX [BZ #2872]" and the "[PATCH v9]" gets its own bug-report (number) and title for clarity in communication? The bug report for [PATCH v9] ("Countries" locales) should then ideally have your (and others) explicit requirements as to the GOST System A/B fall-back, which countries to include etc. Again, myself I have no other req. here but just to have _any_ translit in place. This way it would probably be easier to have the decision making process tied up for both patches (separately). We may want to get the v12 POSIX out of the door in 2.30 then and can take all the time we need to set up the rules for "Countries" locales as you need them to be. Bests, Egor