From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 22508 invoked by alias); 26 Nov 2019 10:11:30 -0000 Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org Received: (qmail 22461 invoked by uid 89); 26 Nov 2019 10:11:30 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-1.3 required=5.0 tests=AWL,BAYES_00,BODY_8BITS,GARBLED_BODY,KAM_SHORT,MIME_QP_LONG_LINE,RCVD_IN_DNSWL_NONE,SPF_HELO_PASS,SPF_PASS autolearn=no version=3.3.1 spammy=ETA, heels, geographical, UD:gr X-HELO: mail-40134.protonmail.ch Date: Tue, 26 Nov 2019 10:11:00 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kobylkin.com; s=protonmail; t=1574763082; bh=qBkqqzWJ/Lb7Atl419KsVpaTwx+MKbllwJXJM3vHaFA=; h=Date:To:From:Cc:Reply-To:Subject:Feedback-ID:From; b=aaOiMGC39QzhAO3KUKptlAgb1Vo5GRjvO20wOL/8S6kuy+tKioSRLMP+TGfqZpkjz p+z9pe1/Ph27L7hGLio+xRQXInFtbkoDWH3KFjnIzfgpwQOfzNkT7YUSfXc9hJr2yf o03BJz56AU9QTKZV1eVNI1izcaMB+/fUXzFOFjJk= To: "libc-locales@sourceware.org" , "libc-alpha@sourceware.org" From: "Diego (Egor) Kobylkin" Cc: Florian Weimer , Marko Myllynen Reply-To: "Diego (Egor) Kobylkin" Subject: [PING] [PATCHv3] locale/C-translit.h.in: Greek -> ASCII transliteration table [BZ #12031] Message-ID: <6GTlcJhWmGNcheqlbQqj3X0yaAGJ_UF3TKKbh8nLc4ldwZ-HG1j6a_ztKLcsQykq3CyjI6vjHCfH4GdbKPd7SYIBhl-P69wZD0GQ5MYs044=@kobylkin.com> MIME-Version: 1.0 Content-Type: multipart/signed; protocol="application/pgp-signature"; micalg=pgp-sha512; boundary="---------------------2f1a33f374d8dd6a1d5f65b8f435c0a4"; charset=UTF-8 X-SW-Source: 2019-q4/txt/msg00083.txt.bz2 This is an OpenPGP/MIME signed message (RFC 4880 and 3156) -----------------------2f1a33f374d8dd6a1d5f65b8f435c0a4 Content-Type: multipart/mixed;boundary=---------------------e7882326fb39ca37cf1cd2d18f474420 -----------------------e7882326fb39ca37cf1cd2d18f474420 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain;charset=utf-8 Content-length: 4175 ping =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 Original Me= ssage =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 On Thursday, November 14, 2019 1:40 PM, Florian Weimer = wrote: > * Diego Kobylkin: >=20 > > -# https://www.gnu.org/licenses/. > > +# http://www.gnu.org/licenses/. >=20 > Spurious change. >=20 > I think we should take this=E2=80=94even if imperfect, it beats all those= ?. >=20 > Thanks, > Florian =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 Original Me= ssage =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 On Thursday, November 14, 2019 2:14 PM, Diego (Egor) Kobylkin wrote: > Changelog: >=20 > v3 >=20 > - spurious change (https/http) removed >=20=20=20=20=20 > v2 >=20=20=20=20=20 > - ETA WITH TONOS is now transliterated as I/i to be consistent througho= ut the table. Ancient Greek calls for E/e and modern for I/i which we are t= aking here. >=20=20=20=20=20 > =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 Origi= nal Message =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 > On Thursday, November 14, 2019 1:29 PM, Diego (Egor) Kobylkin egor@ko= bylkin.com wrote: >=20=20=20=20=20 >=20 > > Changelog: > > v2 > >=20 > > - ETA WITH TONOS is now transliterated as I/i to be consistent throug= hout the table. Ancient Greek calls for E/e and modern for I/i which we are= taking here. > > Thanks Florian for the feedback on this! > > Egor > > =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 Ori= ginal Message =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80= =90 > > On Wednesday, September 4, 2019 9:31 AM, Diego (Egor) Kobylkin egor= @kobylkin.com wrote: > >=20=20=20=20=20 > >=20 > > > Dear locale maintainers, > > > fix the glibc bug 12031 "iconv -t ascii//translit with Greek characte= rs" [1] > > > add Greek transliteration rows to locale/C-translit.h.in. > > > This work is done on the heels of the successfully committed patch fo= r the > > > virtually the same bug [BZ #2872] but concerning Cyrillic characters.= [2] > > > AFAIK there are many versions of transcription tables for Greek to AS= CII > > > transcription. Given that current iconv logic can only translit one t= o many > > > but not many to many symbols we take the "Standard" part of > > > the Romanization_of_Greek#Modern_Greek table [3] > > > and only keep the one letter Greek graphems. That "standard" seems to= be close to > > > the ELOT 743 indeed but not the same. > > > So we omit things like M and =CE=9C=CF=80 being transliterated as M a= nd B accordingly. > > > Rather =CE=9C=CF=80 will be treated like two separate graphems and tr= ansliterated as Mp. > > > Here is the list of some standards I have collected so far. There doe= sn't seem > > > a way to harmonize them all into one. But if anyone want to propose a= solution - > > > please do. > > >=20 > > > - =CE=95=CE=9B=CE=9F=CE=A4 743 https://www.teicrete.gr/users/kutrul= is/Ergalia/ELOT743.htm Passports. > > >=20=20=20=20=20 > > > - ISO 843 https://en.wikipedia.org/wiki/ISO_843 > > >=20=20=20=20=20 > > > - ALA-LC https://www.loc.gov/catdir/cpso/romanization/greek.pdf Boo= k titles. > > >=20=20=20=20=20 > > > - BGN/PCGN http://libraries.ucsd.edu/bib/fed/USBGN_romanization.pdf > > >=20=20=20=20=20 > > > - http://geonames.nga.mil/gns/html/Romanization/Romanization_Greek.= pdf Geographical names. > > > Furthermore to cover the whole U0370-U03FF Greek/Coptic Unicode r= ange I have > > > asked around and made a best effort transliteration for the rest = of characters > > > not covered in above standards. > > > Should you have better sources for the actual translit entries pl= ease make sure to > > > send your feedback! > > > The patch is attached. > > > Best regards, > > > Egor Kobylkin > > > https://sourceware.org/bugzilla/show_bug.cgi?id=3D12031 [1] > > > https://sourceware.org/ml/libc-alpha/2019-07/msg00477.html [2] > > > https://en.wikipedia.org/wiki/Romanization_of_Greek#Modern_Greek = [3] > > > -----------------------e7882326fb39ca37cf1cd2d18f474420 Content-Type: application/pgp-keys; filename="publickey - egor@kobylkin.com - 0x01FEB4E8.asc"; name="publickey - egor@kobylkin.com - 0x01FEB4E8.asc" Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="publickey - egor@kobylkin.com - 0x01FEB4E8.asc"; name="publickey - egor@kobylkin.com - 0x01FEB4E8.asc" Content-length: 891 LS0tLS1CRUdJTiBQR1AgUFVCTElDIEtFWSBCTE9DSy0tLS0tDQpWZXJzaW9u OiBPcGVuUEdQLmpzIHY0LjYuMg0KQ29tbWVudDogaHR0cHM6Ly9vcGVucGdw anMub3JnDQoNCnhqTUVYTGN4NkJZSkt3WUJCQUhhUnc4QkFRZEFUYVpYRStO US9ZYXJYRk9jTEhJQk9DSWJ6TXNnNXpQZQ0KSTZ5VzR4OHBQVlhOSnlKbFoy OXlRR3R2WW5sc2EybHVMbU52YlNJZ1BHVm5iM0pBYTI5aWVXeHJhVzR1DQpZ Mjl0UHNKM0JCQVdDZ0FmQlFKY3R6SG9CZ3NKQndnREFnUVZDQW9DQXhZQ0FR SVpBUUliQXdJZUFRQUsNCkNSQStPcVNEZ0FHcG9acmVBUDlOTUdxMXZ1UVJi Y1hBbGhZbStvRU9XMGVWYXRyK0RJcDRBdGJoYzdkZw0KUUFFQXA1NjBKMFEz RHpmK1BKY1pDdFBHeERlOWZWVkZyelBYUzN3MTBYN00wd2ZPT0FSY3R6SG9F Z29yDQpCZ0VFQVpkVkFRVUJBUWRBb2RSbXRLSDkwV0ZMZzlwTHloS0c2b0Rv ZWpIdWhjOEd0eTROSXlhRUxtd0QNCkFRZ0h3bUVFR0JZSUFBa0ZBbHkzTWVn Q0d3d0FDZ2tRUGpxa2c0QUJxYUVtc2dFQTZnSWdWQ29jMVp0cw0KWWMyNVh6 MEtVWXNuMWtPNEZxZmwyd2pQNzVUYkxYZ0EvQW9odWdlc2xXZVFsRTdUQ2Fh U3hFV0RXL2xYDQo4SmRlTEo4dFlIZFEvNU1MDQo9T0JwMQ0KLS0tLS1FTkQg UEdQIFBVQkxJQyBLRVkgQkxPQ0stLS0tLQ0K -----------------------e7882326fb39ca37cf1cd2d18f474420-- -----------------------2f1a33f374d8dd6a1d5f65b8f435c0a4 Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" Content-length: 217 -----BEGIN PGP SIGNATURE----- Version: ProtonMail wl4EARYKAAYFAl3c+kEACgkQPjqkg4ABqaH5cAEA4v4/c0hbWVPKLNDAYVUB ZPT/oCQ5pcBiHOD07mdgsMMBAKDMjCN2DONkfF3giWS0JiQSx6s1Y/06yHgO 4b1+JaMG =zIUO -----END PGP SIGNATURE----- -----------------------2f1a33f374d8dd6a1d5f65b8f435c0a4--