From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 93524 invoked by alias); 8 Nov 2019 12:36:55 -0000 Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org Received: (qmail 93505 invoked by uid 89); 8 Nov 2019 12:36:55 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-0.9 required=5.0 tests=AWL,BAYES_00,BODY_8BITS,GARBLED_BODY,KAM_SHORT,RCVD_IN_DNSWL_LOW,SPF_HELO_PASS,SPF_PASS autolearn=no version=3.3.1 spammy=UD:gr, heels, Greek, tuesday X-HELO: mail-40132.protonmail.ch Date: Fri, 08 Nov 2019 12:36:00 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kobylkin.com; s=protonmail; t=1573216608; bh=TLPtEkrXHXAWTtgjIaSKFMDZwaRCjQnqgZ+6jhqEqx4=; h=Date:To:From:Cc:Reply-To:Subject:Feedback-ID:From; b=lsF1ubpb3MUfNR4zhfsOKMTj8imiJEd/PVZgUSFpAJkEmVyj0KHlhuf1iOlICvTbm MuQ0fwnHnUX/9OQLjFLSYkyw4iUL4jGxSWKY/ACm06Md0g+Dn487cs+FfMBlB87N4/ 2pJSeuJ5F9Vw9mSR32S4CE+ZU/DQnb8JLrtVfGpM= To: "libc-locales@sourceware.org" , "libc-alpha@sourceware.org" From: "Diego (Egor) Kobylkin" Cc: Carlos O'Donell , Rafal Luzynski , Marko Myllynen , Florian Weimer Reply-To: "Diego (Egor) Kobylkin" Subject: [PING^3] Re: [PATCH] locale/C-translit.h.in: Greek -> ASCII transliteration table [BZ #12031] Message-ID: MIME-Version: 1.0 Content-Type: multipart/signed; protocol="application/pgp-signature"; micalg=pgp-sha512; boundary="---------------------8d40b1d6e0f8685e158ec837510a2036"; charset=UTF-8 X-SW-Source: 2019-q4/txt/msg00057.txt.bz2 This is an OpenPGP/MIME signed message (RFC 4880 and 3156) -----------------------8d40b1d6e0f8685e158ec837510a2036 Content-Type: multipart/mixed;boundary=---------------------c70c6a6324a445e41ea11376520028cc -----------------------c70c6a6324a445e41ea11376520028cc Content-Transfer-Encoding: quoted-printable Content-Type: text/plain;charset=utf-8 Content-length: 2945 =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 Original Me= ssage =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 On Thursday, October 24, 2019 8:40 AM, Diego (Egor) Kobylkin wrote: > =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 Original = Message =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 > On Tuesday, October 15, 2019 1:26 PM, Diego (Egor) Kobylkin egor@kobylkin= .com wrote: >=20 > > =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 Origina= l Message =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 > > On Wednesday, September 4, 2019 9:31 AM, Diego (Egor) Kobylkin egor@kob= ylkin.com wrote: > >=20 > > > Dear locale maintainers, > > > fix the glibc bug 12031 "iconv -t ascii//translit with Greek characte= rs" [1] > > > add Greek transliteration rows to locale/C-translit.h.in. > > > This work is done on the heels of the successfully committed patch fo= r the > > > virtually the same bug [BZ #2872] but concerning Cyrillic characters.= [2] > > > AFAIK there are many versions of transcription tables for Greek to AS= CII > > > transcription. Given that current iconv logic can only translit one t= o many > > > but not many to many symbols we take the "Standard" part of > > > the Romanization_of_Greek#Modern_Greek table [3] > > > and only keep the one letter Greek graphems. That "standard" seems to= be close to > > > the ELOT 743 indeed but not the same. > > > So we omit things like M and =CE=9C=CF=80 being transliterated as M a= nd B accordingly. > > > Rather =CE=9C=CF=80 will be treated like two separate graphems and tr= ansliterated as Mp. > > > Here is the list of some standards I have collected so far. There doe= sn't seem > > > a way to harmonize them all into one. But if anyone want to propose a= solution - > > > please do. > > >=20 > > > - =CE=95=CE=9B=CE=9F=CE=A4 743 https://www.teicrete.gr/users/kutrul= is/Ergalia/ELOT743.htm Passports. > > > - ISO 843 https://en.wikipedia.org/wiki/ISO_843 > > > - ALA-LC https://www.loc.gov/catdir/cpso/romanization/greek.pdf Boo= k titles. > > > - BGN/PCGN http://libraries.ucsd.edu/bib/fed/USBGN_romanization.pdf > > > - http://geonames.nga.mil/gns/html/Romanization/Romanization_Greek.= pdf Geographical names. > > >=20 > > > Furthermore to cover the whole U0370-U03FF Greek/Coptic Unicode range= I have > > > asked around and made a best effort transliteration for the rest of c= haracters > > > not covered in above standards. > > > Should you have better sources for the actual translit entries please= make sure to > > > send your feedback! > > > The patch is attached. > > > Best regards, > > > Egor Kobylkin > > > https://sourceware.org/bugzilla/show_bug.cgi?id=3D12031 [1] > > > https://sourceware.org/ml/libc-alpha/2019-07/msg00477.html [2] > > > https://en.wikipedia.org/wiki/Romanization_of_Greek#Modern_Greek [3] -----------------------c70c6a6324a445e41ea11376520028cc Content-Type: application/pgp-keys; filename="publickey - egor@kobylkin.com - 0x01FEB4E8.asc"; name="publickey - egor@kobylkin.com - 0x01FEB4E8.asc" Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="publickey - egor@kobylkin.com - 0x01FEB4E8.asc"; name="publickey - egor@kobylkin.com - 0x01FEB4E8.asc" Content-length: 891 LS0tLS1CRUdJTiBQR1AgUFVCTElDIEtFWSBCTE9DSy0tLS0tDQpWZXJzaW9u OiBPcGVuUEdQLmpzIHY0LjYuMg0KQ29tbWVudDogaHR0cHM6Ly9vcGVucGdw anMub3JnDQoNCnhqTUVYTGN4NkJZSkt3WUJCQUhhUnc4QkFRZEFUYVpYRStO US9ZYXJYRk9jTEhJQk9DSWJ6TXNnNXpQZQ0KSTZ5VzR4OHBQVlhOSnlKbFoy OXlRR3R2WW5sc2EybHVMbU52YlNJZ1BHVm5iM0pBYTI5aWVXeHJhVzR1DQpZ Mjl0UHNKM0JCQVdDZ0FmQlFKY3R6SG9CZ3NKQndnREFnUVZDQW9DQXhZQ0FR SVpBUUliQXdJZUFRQUsNCkNSQStPcVNEZ0FHcG9acmVBUDlOTUdxMXZ1UVJi Y1hBbGhZbStvRU9XMGVWYXRyK0RJcDRBdGJoYzdkZw0KUUFFQXA1NjBKMFEz RHpmK1BKY1pDdFBHeERlOWZWVkZyelBYUzN3MTBYN00wd2ZPT0FSY3R6SG9F Z29yDQpCZ0VFQVpkVkFRVUJBUWRBb2RSbXRLSDkwV0ZMZzlwTHloS0c2b0Rv ZWpIdWhjOEd0eTROSXlhRUxtd0QNCkFRZ0h3bUVFR0JZSUFBa0ZBbHkzTWVn Q0d3d0FDZ2tRUGpxa2c0QUJxYUVtc2dFQTZnSWdWQ29jMVp0cw0KWWMyNVh6 MEtVWXNuMWtPNEZxZmwyd2pQNzVUYkxYZ0EvQW9odWdlc2xXZVFsRTdUQ2Fh U3hFV0RXL2xYDQo4SmRlTEo4dFlIZFEvNU1MDQo9T0JwMQ0KLS0tLS1FTkQg UEdQIFBVQkxJQyBLRVkgQkxPQ0stLS0tLQ0K -----------------------c70c6a6324a445e41ea11376520028cc-- -----------------------8d40b1d6e0f8685e158ec837510a2036 Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" Content-length: 217 -----BEGIN PGP SIGNATURE----- Version: ProtonMail wl4EARYKAAYFAl3FYVsACgkQPjqkg4ABqaEfRAEAvHUD1XNGsIKEZUNyihAu B6GUoIdNx6OP7Wb/oKG3qyYA/0lKrMYjUxiqopUBFbwH30KYbRuiNisZQO9t n9vsFD0J =1W5z -----END PGP SIGNATURE----- -----------------------8d40b1d6e0f8685e158ec837510a2036--