public inbox for libc-locales@sourceware.org
 help / color / mirror / Atom feed
From: "Diego (Egor) Kobylkin" <egor@kobylkin.com>
To: Rafal Luzynski <digitalfreak@lingonborough.com>
Cc: Carlos O'Donell <codonell@redhat.com>,
	Marko Myllynen <myllynen@redhat.com>,
	"libc-alpha@sourceware.org" <libc-alpha@sourceware.org>,
	"libc-locales@sourceware.org" <libc-locales@sourceware.org>,
	Siddhesh Poyarekar <siddhesh@gotplt.org>,
	Mike Fabian <mfabian@redhat.com>
Subject: Re: [PING^8][PATCH v12] Locales: Cyrillic -> ASCII transliteration [BZ #2872]
Date: Mon, 17 Jun 2019 08:59:00 -0000	[thread overview]
Message-ID: <6wv51loHLfKSGkdpVWewwuBG_P28BI2WZGNfStpjj6TdejdGYIhMynrzeLfh63mRjURnIVT7eciFtrgidjwlhJd7hWg-TfByCOUo5AjCZ4w=@kobylkin.com> (raw)
In-Reply-To: <1728627823.1766022.1560206406480@poczta.nazwa.pl>


[-- Attachment #1.1: Type: text/plain, Size: 2857 bytes --]


Carlos, 


we seem to have a consensus of all involved that the patch can be committed as is. 

Do you see it like this on your side as well or are there any more questions or suggestions?

Bests,
Egor

P.S. Just a clarification to Rafal points below and thanks @Rafal for the intensive "peer review" so far!
It definitely looks to me like we finally don't have any more divergent points after all the issues discussed.


‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐
On Tuesday, June 11, 2019 12:40 AM, Rafal Luzynski <digitalfreak@lingonborough.com> wrote:
...
> 7.06.2019 14:59 "Diego (Egor) Kobylkin" egor@kobylkin.com wrote:
> > But the target system doesn't support Russian locale and so you must
> > transliterate the filenames.
> 

> While talking about the filesystem: I think the problem is not
> that it does not support Russian locale but that it tries to
> handle it and fails at this. If the filesystem accepted any
> byte string as a file name wouldn't it accept a byte string which
> constructs correct Cyrillic characters in UTF-8, without any
> transliteration?

Just to clarify here - the need to transliterate is the essential part in this example, not the actual cause of that need. 

A lot of "things" don't support UTF-8 or Cyrillic - filesystems, some UNIX power tools, older network appliances, databases, key-value stores etc. We are talking about a situation where you are forced to transliterate to ASCII. So that requirement is a given. 


...
> 

> > In glibc we don't have any framework for an intelligent conversion.
> > We would have to write specific code to handle this case and add
> > it into the translit code for special handling in this case.
> 

> My suggestion was to add such an intelligent conversion. The rule
> should be simple: if a letter is followed by a lowercase it should
> be a titlecase (Sh), otherwise it should be uppercase (SH). But
> this may break Egor's requirement to keep them always uppercase.

Again for the record my "requirement" is to have a minimal patch committed sooner than later. It turned out surprisingly difficult to keep our focus even on a single flat mapping table that the ASCII transliteration really is. 


> 

> > I think we should today leave "Ш"->"SH" and "Сх"->"Sh", since it's
> > the most conservative position that avoids ambiguity, and then we
> > can discuss the aesthetics of this and the other impacts and solutions.
> > I appreciate Rafal's position, but I think being conservative here,
> > even if it's not as pretty as uconv, is a good guiding idea.
> 

> Just to summarize: if you want to apply the relaxed rules, more
> technical than linguistic, then I am more willing to accept these
> patches.

The great thing is that we seem to have a consensus now and can proceed.



[-- Attachment #1.2: publickey - egor@kobylkin.com - 0x01FEB4E8.asc --]
[-- Type: application/pgp-keys, Size: 657 bytes --]

[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 249 bytes --]

      reply	other threads:[~2019-06-17  8:59 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-06-05  6:47 Diego (Egor) Kobylkin
2019-06-05 23:51 ` Rafal Luzynski
2019-06-06  9:42   ` Marko Myllynen
2019-06-06 21:31     ` Rafal Luzynski
2019-06-07  0:58       ` Carlos O'Donell
2019-06-07  9:46         ` Diego (Egor) Kobylkin
2019-06-07 11:11           ` Rafal Luzynski
2019-06-07 10:51         ` Rafal Luzynski
2019-06-07 12:36           ` Carlos O'Donell
2019-06-07 13:00             ` Diego (Egor) Kobylkin
2019-06-07 21:17               ` Carlos O'Donell
2019-06-10 22:44             ` Rafal Luzynski
2019-06-17  8:59               ` Diego (Egor) Kobylkin [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='6wv51loHLfKSGkdpVWewwuBG_P28BI2WZGNfStpjj6TdejdGYIhMynrzeLfh63mRjURnIVT7eciFtrgidjwlhJd7hWg-TfByCOUo5AjCZ4w=@kobylkin.com' \
    --to=egor@kobylkin.com \
    --cc=codonell@redhat.com \
    --cc=digitalfreak@lingonborough.com \
    --cc=libc-alpha@sourceware.org \
    --cc=libc-locales@sourceware.org \
    --cc=mfabian@redhat.com \
    --cc=myllynen@redhat.com \
    --cc=siddhesh@gotplt.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).