From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 83303 invoked by alias); 1 Sep 2018 18:46:06 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 83293 invoked by uid 89); 1 Sep 2018 18:46:06 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-1.8 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS autolearn=ham version=3.3.2 spammy=H*M:1c69fb81, H*M:google, HX-Received:Sat, HX-Received:Sep X-HELO: mail-oi0-f68.google.com Received: from mail-oi0-f68.google.com (HELO mail-oi0-f68.google.com) (209.85.218.68) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Sat, 01 Sep 2018 18:46:04 +0000 Received: by mail-oi0-f68.google.com with SMTP id m11-v6so27317157oic.2 for ; Sat, 01 Sep 2018 11:46:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=message-id:date:from:subject:to:references:user-agent; bh=Us8mk7WqmYqbrw7AzAWVaxh/0l62CR5xFTddl9Iri1k=; b=sxiXvka+kdl8jiJDb7meN50McM4VdzkFSo3fVPZnhbo8jCVK5fCWwPAC/OQsksnkdy ZRWnW5NeaisLsJaBENMU+l63hWP9qcaCCL1x19QaKd0foAszO8UR/weKXp1TRF9FO4he 5d+6qltfIQiRl4hskW2v0vYBXd+weqTDO1FNKzHeug0POyT4qmhDrdNG9L6fRx8T3RL3 EfPezQXpOhOjAI2X0PxbFM9ZvI8ompiJeMMLrKYFbu8JYT1E0wt544QMnXe2VnB7aW5k rUjZb+SdN7/rQEtrK1G+x6uZwi7kAcVAEXludbLjWpFXw97kVSLBiwG6lAXMAPyR7Zd6 d5iQ== Return-Path: Received: from BkG (cpe-76-182-193-237.tx.res.rr.com. [76.182.193.237]) by smtp.gmail.com with ESMTPSA id i204-v6sm18784281oia.41.2018.09.01.11.46.01 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sat, 01 Sep 2018 11:46:02 -0700 (PDT) Message-ID: <5b8ade6a.1c69fb81.e5a2.6ed8@mx.google.com> Date: Sat, 01 Sep 2018 18:46:00 -0000 From: Steven Penny Subject: Re: Cygwin fails to utilize Unicode replacement character To: cygwin@cygwin.com References: Content-Type: text/plain; charset=utf8; format=flowed User-Agent: Tryst/2.8.0 (cup.github.io/tryst) X-SW-Source: 2018-09/txt/msg00017.txt.bz2 On Sat, 1 Sep 2018 20:11:15, Thomas Wolff wrote: > Which terminals are used and what's the output of `locale` and `cat > --version` in both cases? Linux: $ echo "$TERM" xterm-256color $ locale LANG=en_US.UTF-8 LC_CTYPE="en_US.UTF-8" LC_NUMERIC="en_US.UTF-8" LC_TIME="en_US.UTF-8" LC_COLLATE=C LC_MONETARY="en_US.UTF-8" LC_MESSAGES="en_US.UTF-8" LC_PAPER="en_US.UTF-8" LC_NAME="en_US.UTF-8" LC_ADDRESS="en_US.UTF-8" LC_TELEPHONE="en_US.UTF-8" LC_MEASUREMENT="en_US.UTF-8" LC_IDENTIFICATION="en_US.UTF-8" LC_ALL= $ cat --version cat (GNU coreutils) 8.29 Cygwin: $ echo "$TERM" cygwin $ locale LANG=en_US.UTF-8 LC_CTYPE="en_US.UTF-8" LC_NUMERIC="en_US.UTF-8" LC_TIME="en_US.UTF-8" LC_COLLATE="C" LC_MONETARY="en_US.UTF-8" LC_MESSAGES="en_US.UTF-8" LC_ALL= $ cat --version cat (GNU coreutils) 8.26 Note that in addition to Linux, Windows PowerShell also gives correct output: $ pwsh -c '[system.text.encoding]::UTF8.getString(0xEB)' � compare again with Cygwin: $ printf '\xEB' ▒ -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple