From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 121470 invoked by alias); 25 Jun 2018 01:31:08 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 121324 invoked by uid 89); 25 Jun 2018 01:30:50 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-4.2 required=5.0 tests=BAYES_50,GIT_PATCH_2 autolearn=ham version=3.3.2 spammy=UD:uk, H*r:8.14.7, UD:co.uk, OVER X-HELO: Ishtar.sc.tlinx.org Received: from ishtar.tlinx.org (HELO Ishtar.sc.tlinx.org) (173.164.175.65) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Mon, 25 Jun 2018 01:30:48 +0000 Received: from [192.168.3.12] (Athenae [192.168.3.12]) by Ishtar.sc.tlinx.org (8.14.7/8.14.4/SuSE Linux 0.8) with ESMTP id w5P1UPG1016608 for ; Sun, 24 Jun 2018 18:30:27 -0700 Message-ID: <5B3045B1.4080504@tlinx.org> Date: Mon, 25 Jun 2018 09:56:00 -0000 From: L A Walsh User-Agent: Thunderbird MIME-Version: 1.0 To: cygwin@cygwin.com Subject: Re: UTF-8 character encoding References: <1183751257.20180621042620@yandex.ru> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-IsSubscribed: yes X-SW-Source: 2018-06/txt/msg00269.txt.bz2 Lee wrote: > So... keep it simple, set > LANG=en_US.UTF-8 > and use vi or something else that comes with cygwin to create the file > and I'll have a file with UTF-8 character encoding - correct? --- The first 127 characters of UTF-8 are identical to the first 127 characters of ASCII, and latin1 and iso-8859-1. If you don't use any characters that need accents or special symbols, then nothing will be encoded in UTF-8, because its only the characters OVER the first 127 (see chart @ http://www.babelstone.co.uk/Unicode/babelmap.html). The site also has a sw util (http://www.babelstone.co.uk/Software/BabelMap.html), that displays and helps config fonts to display all the characters in unicode, though it hasn't been updated to the changes that came out last month or so (Unicode 11). It's a cool little, *free*, utility...though if you find it useful you can always send in your registration. -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple