From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 105173 invoked by alias); 5 Dec 2017 03:48:22 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 105133 invoked by uid 89); 5 Dec 2017 03:48:21 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=2.3 required=5.0 tests=AWL,BAYES_50,KAM_LAZY_DOMAIN_SECURITY,LIKELY_SPAM_SUBJECT,RCVD_IN_DNSWL_LOW autolearn=no version=3.3.2 spammy=regional, H*R:D*ca, mintty, calgary X-HELO: smtp-out-so.shaw.ca Received: from smtp-out-so.shaw.ca (HELO smtp-out-so.shaw.ca) (64.59.136.138) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Tue, 05 Dec 2017 03:48:19 +0000 Received: from [192.168.1.100] ([24.64.240.204]) by shaw.ca with SMTP id M4DGepJV1ss4TM4DHencCv; Mon, 04 Dec 2017 20:48:07 -0700 X-Authority-Analysis: v=2.2 cv=JuuBlIwC c=1 sm=1 tr=0 a=MVEHjbUiAHxQW0jfcDq5EA==:117 a=MVEHjbUiAHxQW0jfcDq5EA==:17 a=N659UExz7-8A:10 a=eWUNByC9m1pz4WcI4cYA:9 a=pILNOxqGKmIA:10 Reply-To: Brian.Inglis@SystematicSw.ab.ca Subject: Re: Need help with multibyte UTF-8 characters To: cygwin@cygwin.com References: <626a3c06-e9f2-1932-f1f3-47ddb2051215@gmail.com> From: Brian Inglis Message-ID: <3317b062-e0b1-03dd-b289-97823217dd27@SystematicSw.ab.ca> Date: Tue, 05 Dec 2017 03:48:00 -0000 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: <626a3c06-e9f2-1932-f1f3-47ddb2051215@gmail.com> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 8bit X-CMAE-Envelope: MS4wfBpSDYXIc6S+I1uEq6jYw1PxW0gsDvRCWml2img8z/QtT+RBZEPlM8TgOwk4QEDbMrtjnGFdzh4/DcQqkROa68TxmkFWmDGdcvDhOrmpipa1vp7K2VEM kZiVgWFWDy89PeI9Zbc6poYANWOAS8MYTTT9egp5NKke3ubA+7/mwRk/1/1ssVr0XdYr7yGb50Lg0g== X-IsSubscribed: yes X-SW-Source: 2017-12/txt/msg00035.txt.bz2 On 2017-12-04 18:23, Thomas Taylor wrote: > I want to use multibyte UTF-8 characters in 64-bit Cygwin under Windows 7.  The > "vim" editor running in mintty displays the two-byte characters correctly, but > not the three- (and I assume four-) byte characters, which instead display as > rectangular filled-in blocks.  The "less" program doesn't even display two-byte > characters correctly, but instead displays them as to , depending on > the character in question, in reverse color in the terminal window.  The "cat" > program is even worse, replacing every two-byte character with a character that > looks like three horizontal bars stacked one above the other.  I've read the > "Internationalization" page in the Cygwin online manual, but am still baffled.  > My LANG environment variable is set to "en_US.UTF-8".  Can anyone help? Check mintty/Options/Text/Locale[en_US]/Character set[UTF-8]/Apply/Save. Then exit and restart mintty and your shell. To see what locale Cygwin thinks you are set to, run: $ locale To check all Windows locale settings, you can run: $ for o in -s -u -n -i -f ''; do locale $o; done The first two should show your Windows install locale, the rest should show anything you have set up, or the same locale. If any settings don't match LANG, you may have to set LC_ALL=$LANG to force the setting. I use the following profile stanza across all systems for consistency: # Set user-defined locale - use regional settings if available locale -fU > /dev/null 2>&1 \ && LC_ALL=`locale -fU` \ || LC_ALL=`locale | \ /bin/sed '/^LANG=\|^LC_CTYPE=\|^LC_ALL=/{s///;h};$!d;x;s/"//g'` export LC_ALL -- Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple