From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 40961 invoked by alias); 1 Sep 2018 21:07:19 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 40953 invoked by uid 89); 1 Sep 2018 21:07:18 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: =?ISO-8859-1?Q?No, score=0.1 required=5.0 tests=AWL,BAYES_00,FOREIGN_BODY,GIT_PATCH_2,KAM_LAZY_DOMAIN_SECURITY,RCVD_IN_DNSWL_NONE autolearn=no version=3.3.2 spammy=Viren, viren, geprft, gepr=c3=bcft?= X-HELO: mout.kundenserver.de Received: from mout.kundenserver.de (HELO mout.kundenserver.de) (212.227.17.13) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Sat, 01 Sep 2018 21:07:16 +0000 Received: from [192.168.178.24] ([95.91.209.148]) by mrelayeu.kundenserver.de (mreue101 [212.227.15.183]) with ESMTPSA (Nemesis) id 0MfYct-1gFdXu3lud-00P7th for ; Sat, 01 Sep 2018 23:07:14 +0200 Subject: Re: Cygwin fails to utilize Unicode replacement character To: cygwin@cygwin.com References: <5b8ade6a.1c69fb81.e5a2.6ed8@mx.google.com> From: Thomas Wolff Message-ID: <9def52c2-f547-8300-1faa-e71bfd3fd2d4@towo.net> Date: Sat, 01 Sep 2018 21:07:00 -0000 User-Agent: Mozilla/5.0 (Windows NT 6.1; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 MIME-Version: 1.0 In-Reply-To: <5b8ade6a.1c69fb81.e5a2.6ed8@mx.google.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-IsSubscribed: yes X-SW-Source: 2018-09/txt/msg00024.txt.bz2 Am 01.09.2018 um 20:46 schrieb Steven Penny: > On Sat, 1 Sep 2018 20:11:15, Thomas Wolff wrote: >> Which terminals are used and what's the output of `locale` and `cat >> --version` in both cases? > > ... > > Note that in addition to Linux, Windows PowerShell also gives correct > output: > >    $ pwsh -c '[system.text.encoding]::UTF8.getString(0xEB)' >    � What makes you claim this would be the "correct output"? Where is this defined? > compare again with Cygwin: > >    $ printf '\xEB' >    ▒ Actually, in mintty, this is not (anymore) the MEDIUM SHADE. Please compare. There's also a problem with using MEDIUM SHADE. In an ambiguous-width locale (or explicit ambiguous-width terminal mode), that character has double-width and is therefore not suitable as a replacement for a single illegal UTF-8 byte. Cygwin console does not support double-width so it does not have this problem, but until further clarification I think I'll not change it in mintty. Thomas --- Diese E-Mail wurde von Avast Antivirus-Software auf Viren geprüft. https://www.avast.com/antivirus -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple