From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 15430 invoked by alias); 14 Dec 2017 04:40:55 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 15422 invoked by uid 89); 14 Dec 2017 04:40:55 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=2.3 required=5.0 tests=AWL,BAYES_50,KAM_LAZY_DOMAIN_SECURITY,LIKELY_SPAM_SUBJECT,RCVD_IN_DNSWL_LOW autolearn=no version=3.3.2 spammy=calgary, Calgary, alberta, Alberta X-HELO: smtp-out-no.shaw.ca Received: from smtp-out-no.shaw.ca (HELO smtp-out-no.shaw.ca) (64.59.134.13) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Thu, 14 Dec 2017 04:40:54 +0000 Received: from [192.168.1.100] ([24.64.240.204]) by shaw.ca with SMTP id PLKEeyxEJp2osPLKFeOjOY; Wed, 13 Dec 2017 21:40:52 -0700 X-Authority-Analysis: v=2.2 cv=KLEqNBNo c=1 sm=1 tr=0 a=MVEHjbUiAHxQW0jfcDq5EA==:117 a=MVEHjbUiAHxQW0jfcDq5EA==:17 a=N659UExz7-8A:10 a=82XOVZ_bcgWAQp1Wn9YA:9 a=pILNOxqGKmIA:10 Reply-To: Brian.Inglis@SystematicSw.ab.ca Subject: Re: Need help with multibyte UTF-8 characters To: cygwin@cygwin.com References: <626a3c06-e9f2-1932-f1f3-47ddb2051215@gmail.com> <89a7d883-e3e5-42d7-2d22-020cdf6ccbdf@towo.net> From: Brian Inglis Message-ID: <8d811915-176f-c9f5-6500-364e494142f7@SystematicSw.ab.ca> Date: Thu, 14 Dec 2017 07:36:00 -0000 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: <89a7d883-e3e5-42d7-2d22-020cdf6ccbdf@towo.net> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-CMAE-Envelope: MS4wfHe8/By02jxeXOF1YxDkwNnKzJyv9YRmJqWlBPV42uqOg1OVFdj6c5GT7/NFQRWd4EHWeiMkPgr1rvEtFsFl604ndNfD+Rqc58EOcxGT5h6j+Lmnagar yzU5eTv51go+dg4JdmgCW6VO033sJGQv/nyEsolhorahN1WAPjKnPY4jOf6MOUbrhfAEwKheffmL/w== X-IsSubscribed: yes X-SW-Source: 2017-12/txt/msg00130.txt.bz2 On 2017-12-13 00:50, Thomas Wolff wrote: > Am 13.12.2017 um 06:21 schrieb Brian Inglis: >> On 2017-12-04 18:23, Thomas Taylor wrote: >> Your Windows Regional settings and your mintty/Options/Text/Language and >> Character Set should be set to match. >> The profile commands below set Cygwin locale to your Windows Regional settings >> and charset to UTF-8, or Unix locale to your system locale. >> Otherwise your system or mintty is going to be doing conversions on each >> character. > I am not aware that mintty character display and Windows regional settings would > interfere in any way you indicated. > Can you elaborate on this please? Maybe I'm just too optimistic that software will DTRT to ensure that output is faithfully passed thru, or converted for the next layer of software, if it has different settings. I set all of my locales the same so characters should pass thru transparently and I can see output faithfully rendered, given adequate font configurations. What happens when your system, terminal, and shell locales and charsets differ? Either some component/-s has/have to do conversion to provide readable output, which is my expectation given the requirement to specify locales and charsets, or you could end up with garbled output if nothing is doing any conversion. Does one override others to pass thru readable output, does conversion occur, or do you just see junk in some or all cases when locales and charsets differ? I am ignoring here the effect on text content, input and output formatting of selecting languages, territories, and scripts. -- Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple