From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mailroot.plain.net.nz (akvs4.plain.net.nz [202.49.68.68]) by sourceware.org (Postfix) with ESMTP id 7A556385782D for ; Thu, 1 Apr 2021 20:35:34 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 7A556385782D Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=cyberXpress.co.nz Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=M.Aitchison@cyberXpress.co.nz Received: from mailroot.plain.net.nz (localhost.localdomain [127.0.0.1]) by mailroot.plain.net.nz (Postfix) with ESMTP id 09B7F86 for ; Fri, 2 Apr 2021 09:35:32 +1300 (NZDT) DKIM-Signature: v=1; a=rsa-sha1; c=simple; d=plain.co.nz; h=to:from :subject:message-id:date:mime-version:content-type :content-transfer-encoding; s=mail; bh=t8w4brQQ5sOP/86D0b37u3ckV YM=; b=Z33Q07iS4w9TyuMK1Vp3ZvHZPGDXch9qRjsjhGfZ6AmnvlKtRSuEyWUz+ YJ8WOlQ55oAjjSHPP5Xg4FYj5LAPtQvVfvk2wBDXOlMa/KmoJ3NEwFxyAHR+uLss D7aRMIqwcl7Ab9UFFFmFXFVqBhteDmMLJOlhGS7bdk+dlErSnU= Received: from Speedy.localdomain (unknown [202.150.103.137]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: aitchison) by mailroot.plain.net.nz (Postfix) with ESMTPSA id DF19285 for ; Fri, 2 Apr 2021 09:35:31 +1300 (NZDT) To: cygwin@cygwin.com From: Mark Aitchison Subject: Perl Unidecode modules - which to use (if not Text::Unidecode)? Message-ID: Date: Fri, 2 Apr 2021 09:35:31 +1300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.8.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-NZ Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=0.1 required=5.0 tests=BAYES_00, BODY_8BITS, DKIM_SIGNED, DKIM_VALID, JMQ_SPF_NEUTRAL, SPF_HELO_NONE, SPF_PASS autolearn=no autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: cygwin@cygwin.com X-Mailman-Version: 2.1.29 Precedence: list List-Id: General Cygwin discussions and problem reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 01 Apr 2021 20:35:36 -0000 I am writing perl programs that I'd like to know will work under both Lin= ux and Cygwin,=20 and have to deal with Unicode now. I had used Text::Unidecode happily in Linux but find no cygwin version. P= ossibly I am not=20 looking in the right places for it, but possibly there are different Unic= ode-related=20 modules that are well-supported under both cygwin and linux that I should= be using=20 instead, and I guess Unicode might be one of those things where it depend= s on the=20 underlying o/s so it probably pays to go with whatever is the standard se= t of modules. 1. What perl Unicode modules should I consider, if not Text::Unidecode? T= he present need=20 is to be able to convert those few "foreign" characters (like =C3=87=C4=86= =C4=88=C4=8A=C3=A7=C4=89=C4=8B=C4=9C=C4=9E=C4=A0=C4=A2=C4=9D=C4=A3=C4=9F=C4= =A1=C3=8B=C3=8C=C3=8D=C3=8E=C3=8F=C3=92=C3=93=C3=94=C3=95)=20 that are basically ASCII with accent marks to their closest ASCII equival= ents, but I'd=20 like to do more with Unicode in the future, without going down any dead-e= nds as far as=20 being able to run under cygwin is concerned. 2. I see some talk of Internationalization in Chapter 2 of "Setting up Cy= gwin", but=20 cannot see anything relating to perl modules, and I don't see any easy wa= y to search many=20 months of the mailing list for a keyword... is there any information I sh= ould know about? Thanks, Mark Aitchison