From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 85091 invoked by alias); 4 Sep 2018 19:53:23 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 85076 invoked by uid 89); 4 Sep 2018 19:53:22 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-1.9 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS autolearn=ham version=3.3.2 spammy=Square, opinion, sides, 2009 X-HELO: mail-oi0-f65.google.com Received: from mail-oi0-f65.google.com (HELO mail-oi0-f65.google.com) (209.85.218.65) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Tue, 04 Sep 2018 19:53:21 +0000 Received: by mail-oi0-f65.google.com with SMTP id k81-v6so9112794oib.9 for ; Tue, 04 Sep 2018 12:53:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=message-id:date:from:subject:to:references:user-agent; bh=HOqnt32zvRBH59Zo7BYjpcDbknrujQCvk8JqT1zvtZ4=; b=Vizsqlima6o74YamZxL0GaEey1lFxazaI5WAF/rNMIBosG8KelWZfv4Acuou1XcKGN ldNCdIa+iY0DDBVgNSAIx6sCpg0ipNYWazFSAS+rT7HBjzJStevkF2YtoOIQaN65dEnr YCNFv0ql9pziFnDqYQO6xAZ6WB0cp14VHoHRhJG5L6gXVEkyyQ0buijSEt7cDLwnnB0L 7Ajuex05xdldrECHwZPzjTrXRJ8JR8Oo7k/kSayMHOPxKROZEKz8AJp99rI0zEQDUrzy FHHrqrkKE4u0icWarKVv1CXpNsRopv2uutZt5AnvGH0SBSMsakh1W1Ts8vwJu8+dEJwK HlZA== Return-Path: Received: from tsk (cpe-76-182-193-237.tx.res.rr.com. [76.182.193.237]) by smtp.gmail.com with ESMTPSA id t141-v6sm41037447oif.18.2018.09.04.12.53.18 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 04 Sep 2018 12:53:18 -0700 (PDT) Message-ID: <5b8ee2ae.1c69fb81.7f961.3c7d@mx.google.com> Date: Tue, 04 Sep 2018 19:53:00 -0000 From: Steven Penny Subject: Re: Cygwin fails to utilize Unicode replacement character To: cygwin@cygwin.com References: <4a728822-3c4f-c99f-51cd-63822445aa18@towo.net> Content-Type: text/plain; charset=utf8; format=flowed User-Agent: Tryst/2.8.0 (cup.github.io/tryst) X-SW-Source: 2018-09/txt/msg00075.txt.bz2 On Tue, 4 Sep 2018 20:41:48, Thomas Wolff wrote: > No idea what you consider dangerous. Anyway, we obviously agree that > hardly any available console font supports the REPLACEMENT CHARACTER. > You had previously suggested code that might work (using CreateFont(0, > 0, ....)). Maybe you can sort out with Corinna how to get that work > inside cygwin. Otherwise, my opinion: > - *working* fallback from FFFD to 2592: good i am fine with this, but i think corinna feels it is too much code for not enough benefit - thats her decision. > - fix FFFD: not good, because the .notdef glyph is not an appropriate > indication of illegal encoding (like broken UTF-8 bytes) not sure what you even mean by this - FFFD doesnt need fixing - Windows just need to adopt some fonts with proper unicode support. we are dealing with their lack of doing that. > the .notdef glyph is not an appropriate indication of illegal encoding (like > broken UTF-8 bytes) true, but neither is U+2592. as far as i know U+2592 is not defined officially anywhere as being a representation of anything other than "MEDIUM SHADE". Corinna originally added it in 2009: http://cygwin.com/git/gitweb.cgi?p=newlib-cygwin.git&a=commitdiff&h=161211d with no justification of why it was chosen that i can tell. similarly, mintty actually changed from U+FFFD to U+2592 in 2009: http://github.com/mintty/mintty/commit/90c11d3 with actually a good reason, which was to avoid ambiguity with fonts that didnt have U+FFFD. but again, no reason why U+2592 was chosen. i personally see both sides of the argument but i tend to land of the side of any standards if they exist. Here is the standard for U+FFFD: http://unicode.org/charts/nameslist/n_FFF0.html > - revert to 2592: OK if we were to use something other than U+FFFD, I would propose U+25A1, as it is also defined by Unicode: 25A1 □ White Square • may be used to represent a missing ideograph http://unicode.org/charts/nameslist/n_25A0.html and it has better support than U+FFFD: yes: - Consolas - Courier New - DejaVu Sans Mono - MS Gothic - NSimSun no: - Lucida Console - SimSun-ExtB -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple