From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 119139 invoked by alias); 10 Aug 2019 09:25:57 -0000 Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org Received: (qmail 119029 invoked by uid 48); 10 Aug 2019 09:25:52 -0000 From: "egor at kobylkin dot com" To: libc-locales@sourceware.org Subject: [Bug localedata/12031] iconv -t ascii//translit with Greek characters Date: Sat, 10 Aug 2019 09:25:00 -0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: unspecified X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: egor at kobylkin dot com X-Bugzilla-Status: REOPENED X-Bugzilla-Resolution: X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: libc-locales at sourceware dot org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2019-q3/txt/msg00030.txt.bz2 https://sourceware.org/bugzilla/show_bug.cgi?id=3D12031 --- Comment #16 from Egor Kobylkin --- AFAIK there are many versions of transcription tables for Greek to ASCII transcription. Given that current iconf logic can only translit one to many= but not many to many symbols we take the "Standard" part of the following table https://en.wikipedia.org/wiki/Romanization_of_Greek#Modern_Greek and only keep the one letter Greek graphems. That "standard" seems to be cl= ose to the ELOT 743 indeed but not the same.=20 So we omit things like M and =CE=9C=CF=80 being transliterated as M and B a= ccordingly. Rather =CE=9C=CF=80 will be treated like two separate graphems and translit= erated as Mp.=20 Here is the list of some standards I have collected so far. There doesn't s= eem a way to harmonize them all into one. But if anyone want to propose a solut= ion - please do. + =CE=95=CE=9B=CE=9F=CE=A4 743 https://www.teicrete.gr/users/kutrulis/Ergal= ia/ELOT743.htm Passports. + ISO 843 https://en.wikipedia.org/wiki/ISO_843 + ALA-LC https://www.loc.gov/catdir/cpso/romanization/greek.pdf Book titles. + BGN/PCGN http://libraries.ucsd.edu/bib/fed/USBGN_romanization.pdf=20 http://geonames.nga.mil/gns/html/Romanization/Romanization_Greek.pdf Geographical names. --=20 You are receiving this mail because: You are the assignee for the bug.