From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 126495 invoked by alias); 13 Dec 2017 05:21:42 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 126478 invoked by uid 89); 13 Dec 2017 05:21:41 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=2.3 required=5.0 tests=AWL,BAYES_50,KAM_LAZY_DOMAIN_SECURITY,LIKELY_SPAM_SUBJECT,RCVD_IN_DNSWL_LOW autolearn=no version=3.3.2 spammy=bars, calgary, Calgary, Alberta X-HELO: smtp-out-no.shaw.ca Received: from smtp-out-no.shaw.ca (HELO smtp-out-no.shaw.ca) (64.59.134.13) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Wed, 13 Dec 2017 05:21:40 +0000 Received: from [192.168.1.100] ([24.64.240.204]) by shaw.ca with SMTP id OzU9eu1ysp2osOzUAeKqG1; Tue, 12 Dec 2017 22:21:38 -0700 X-Authority-Analysis: v=2.2 cv=KLEqNBNo c=1 sm=1 tr=0 a=MVEHjbUiAHxQW0jfcDq5EA==:117 a=MVEHjbUiAHxQW0jfcDq5EA==:17 a=N659UExz7-8A:10 a=eWUNByC9m1pz4WcI4cYA:9 a=pILNOxqGKmIA:10 Reply-To: Brian.Inglis@SystematicSw.ab.ca Subject: Re: Need help with multibyte UTF-8 characters To: cygwin@cygwin.com References: <626a3c06-e9f2-1932-f1f3-47ddb2051215@gmail.com> From: Brian Inglis Message-ID: Date: Wed, 13 Dec 2017 13:07:00 -0000 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: <626a3c06-e9f2-1932-f1f3-47ddb2051215@gmail.com> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 8bit X-CMAE-Envelope: MS4wfPRs8TM/xKe857pX3ZJQlfbUBYMZWyr/f84oH07SndPh5X262nK50f78ED1krxwIu+pJ/E4TUSq/AI/x4PPsDqyV6YZzzgceBw3M7OluO5xJH8rMCoTt 73rZOJp2XdYwZqUFshiPdAkjQLa6wmrNyq3rjyTHMww7epE26TpBhhT7URPxyZcMbaSHjuCNR3Uz3w== X-IsSubscribed: yes X-SW-Source: 2017-12/txt/msg00121.txt.bz2 On 2017-12-04 18:23, Thomas Taylor wrote: > I want to use multibyte UTF-8 characters in 64-bit Cygwin under Windows 7.  The > "vim" editor running in mintty displays the two-byte characters correctly, but > not the three- (and I assume four-) byte characters, which instead display as > rectangular filled-in blocks.  The "less" program doesn't even display two-byte > characters correctly, but instead displays them as to , depending on > the character in question, in reverse color in the terminal window.  The "cat" > program is even worse, replacing every two-byte character with a character that > looks like three horizontal bars stacked one above the other.  I've read the > "Internationalization" page in the Cygwin online manual, but am still baffled.  > My LANG environment variable is set to "en_US.UTF-8".  Can anyone help? Your Windows Regional settings and your mintty/Options/Text/Language and Character Set should be set to match. The profile commands below set Cygwin locale to your Windows Regional settings and charset to UTF-8, or Unix locale to your system locale. Otherwise your system or mintty is going to be doing conversions on each character. # Set user-defined locale locale -fU > /dev/null 2>&1 \ && LC_ALL=$(locale -fU) \ || LC_ALL=$(locale | \ sed '/^LANG=\|^LC_CTYPE=\|^LC_ALL=/{s///;h};$!d;x;s/"//g') -- Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple