From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 117276 invoked by alias); 4 Sep 2018 23:29:02 -0000 Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner@cygwin.com Mail-Followup-To: cygwin@cygwin.com Received: (qmail 117267 invoked by uid 89); 4 Sep 2018 23:29:01 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-1.9 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS autolearn=ham version=3.3.2 spammy=recommendations, H*M:1c69fb81, WHITE, H*M:google X-HELO: mail-oi0-f67.google.com Received: from mail-oi0-f67.google.com (HELO mail-oi0-f67.google.com) (209.85.218.67) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Tue, 04 Sep 2018 23:29:00 +0000 Received: by mail-oi0-f67.google.com with SMTP id y207-v6so10076555oie.13 for ; Tue, 04 Sep 2018 16:29:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=message-id:date:from:subject:to:references:user-agent; bh=mdkZsGgqrZHz5GO6+C4ZvoNzPCM6gvuhSnzmLYH1NQ0=; b=TLPJsuJQQMfImRn3l2kJd85kvTop7FfelKXIrMyg/xAyCB8a7hCQK45drn7pEsr7TM XroKxE5FX+dWedd/iUIoQ+vFtb+iHy+r/aJtm2968MVjokxLtlIGSAb/s/PAd5hoq80f HzaTEKwDj3yHSL/hNx5aAMIyY76pBfyAVUuzRoYjWul867NRe+5F9M5g2yxFIH2avZFE kfVJeb4ic0IF5JoKqmgXTI8n20u1o/vtXjrpNSBALZFAsUOi1cqpUyLKcsjNuMYDWZPb kWXmJHXsbJXIYlKUurSqi65LuekKd+hMWi3h4ExAB05ubAJwnybgZaeMIX7GbMepS/77 8OUg== Return-Path: Received: from cXi (cpe-76-182-193-237.tx.res.rr.com. [76.182.193.237]) by smtp.gmail.com with ESMTPSA id p132-v6sm433862oia.31.2018.09.04.16.28.57 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 04 Sep 2018 16:28:57 -0700 (PDT) Message-ID: <5b8f1539.1c69fb81.3d7fb.2575@mx.google.com> Date: Tue, 04 Sep 2018 23:29:00 -0000 From: Steven Penny Subject: Re: Cygwin fails to utilize Unicode replacement character To: cygwin@cygwin.com References: <5c366e53-ad20-7ccc-5d76-c4fd5adefdf9@towo.net> Content-Type: text/plain; charset=utf8; format=flowed User-Agent: Tryst/2.8.0 (cup.github.io/tryst) X-SW-Source: 2018-09/txt/msg00082.txt.bz2 On Tue, 4 Sep 2018 23:43:16, Thomas Wolff wrote: > Traditionally, many terminals used to display the DEL character as a > checkered block, which is more or less the MEDIUM SHADE. > This makes the glyph appear somewhat "erroneous" by convention. I see - now that Unicode has some dedicated characters for this, it would make sense to use them, especially since linux is already using them: 1. U+FFFD: http://unicode.org/charts/nameslist/n_FFF0.html 2. U+25A1: http://unicode.org/charts/nameslist/n_25A0.html > valid code point with no glyph in font -> .notdef glyph -> WHITE SQUARE this is not true. "WHITE SQUARE" refers to U+25A1, which is an actual character and different from the ".notdef" glyph. as has been discussed as length in this thread, the ".notdef glyph" is not an actual character, but a glyph that exists at position 0 in the font, and while its appearance is not strictly defined, some recommendations exist: - empty rectangle - rectangle with a question mark - rectangle with an X > Now if you switch to FFFD REPLACEMENT CHARACTER for invalid code point, > and considering that it does not exist in most actual fonts and that the > console does not apply font fallback, it will resolve to WHITE SQUARE, thus: > folding the two different use cases into the same appearance, > which is bad. no again, it will resolve to ".notdef glyph", as I put above. otherwise yes, you do have a point. in the case of a font without U+FFFD, you have ultimately: invalid code point: .notdef glyph missing character: .notdef glyph several ideas have been proposed: 1. keep U+FFFD 2. go back to U+2592 3. use U+25A1 instead 4. use U+FFFD if possible else fallback to U+2592 or U+25A1 if we choose option 1, people not happy with the ambiguity can simply install "dejavu-fonts" or similar, which Cygwin provides. -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple