From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 46466 invoked by alias); 19 Nov 2018 19:36:05 -0000 Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org Received: (qmail 46435 invoked by uid 89); 19 Nov 2018 19:36:04 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-1.0 required=5.0 tests=AWL,BAYES_00,KAM_LAZY_DOMAIN_SECURITY,RCVD_IN_DNSWL_NONE autolearn=no version=3.3.2 spammy=Hx-languages-length:1566, H*r:ip*192.168.1.101, national X-HELO: mail-wm1-f68.google.com Return-Path: Reply-To: Marko Myllynen Subject: Re: [PATCH v9] Locales: Cyrillic -> ASCII transliteration table [BZ #2872] To: Egor Kobylkin , libc-alpha@sourceware.org, libc-locales@sourceware.org References: <41532e13-a63d-5df1-ab37-05eb4d6c8d0a@kobylkin.com> <20180412224352.GB2911@altlinux.org> <837001401.21346.1542406647888@poczta.nazwa.pl> <5a247161-c498-ed50-ff4a-58f2ecf974f0@redhat.com> <29627b4c-317d-5e80-f34b-920e0eadadee@kobylkin.com> From: Marko Myllynen Message-ID: <24db2c9f-0f07-c5b7-6c16-1b7993061147@redhat.com> Date: Mon, 19 Nov 2018 19:36:00 -0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.3.0 MIME-Version: 1.0 In-Reply-To: <29627b4c-317d-5e80-f34b-920e0eadadee@kobylkin.com> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-SW-Source: 2018-q4/txt/msg00116.txt.bz2 Hi, On 19/11/2018 11.21, Egor Kobylkin wrote: > On 19.11.18 08:13, Marko Myllynen wrote: >> On 17/11/2018 20.34, Egor Kobylkin wrote: > > Your example only covers _tansliteration_ to Latin Diacritics > iconv -f UTF-8 -t ISO-8859-15//TRANSLIT \ > | iconv -f ISO-8859-15 -t UTF-8 > > while BZ #2872 is about _transcription_ to ASCII > iconv -f UTF-8 -t ASCII//TRANSLIT AFAICS v9 (unlike v10) supported both of the above cases. > The glibc wiki explicitly lists this use case (ASCII) as the test > example https://sourceware.org/glibc/wiki/Locales#Testing_Locales I wrote that section and I certainly wasn't considering Cyrillic aspects at that time (IIRC it was written even before Mike did the major update for transliteration rules at the end of 2015). The context back then was mostly about handling Latin letters like Å, Ä, Ö, Ø, etc. > So again, you are asking to have ISO 9.1995. System A but the bug is > about ISO 9.1995. System B (GOST 7.79-2000) We certainly can decide here what's the best course of action, we do not have to slavishly follow some old bug report when deciding the direction for the implementation. But I think I've made my position clear by now so I'm not going to repeat it anymore. In any case once your patch lands I'm going to submit a follow-up patch for fi_FI to make it compliant with the applicable national standard (SFS 4900) which defines how to do Cyrillic transliteration / transcription in the context Finnish. Thanks, -- Marko Myllynen