From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 10237 invoked by alias); 23 May 2011 14:51:25 -0000 Received: (qmail 10227 invoked by uid 22791); 23 May 2011 14:51:24 -0000 X-SWARE-Spam-Status: No, hits=-2.2 required=5.0 tests=AWL,BAYES_00,DKIM_SIGNED,DKIM_VALID,RCVD_IN_DNSWL_LOW X-Spam-Check-By: sourceware.org Received: from out2.smtp.messagingengine.com (HELO out2.smtp.messagingengine.com) (66.111.4.26) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Mon, 23 May 2011 14:51:05 +0000 Received: from compute3.internal (compute3.nyi.mail.srv.osa [10.202.2.43]) by gateway1.messagingengine.com (Postfix) with ESMTP id D6B2C202AB for ; Mon, 23 May 2011 10:51:04 -0400 (EDT) Received: from frontend1.messagingengine.com ([10.202.2.160]) by compute3.internal (MEProxy); Mon, 23 May 2011 10:51:04 -0400 Received: from [158.147.67.90] (158-147-67-90.harris.com [158.147.67.90]) by mail.messagingengine.com (Postfix) with ESMTPSA id 8C11D400AED; Mon, 23 May 2011 10:51:04 -0400 (EDT) Message-ID: <4DDA7457.5030801@cwilson.fastmail.fm> Date: Mon, 23 May 2011 14:51:00 -0000 From: Charles Wilson Reply-To: Charles Wilson User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US; rv:1.9.2.17) Gecko/20110414 Thunderbird/3.1.10 MIME-Version: 1.0 To: cygwin-talk@cygwin.com Subject: Re: internationalized text processing References: <2BF01EB27B56CC478AD6E5A0A28931F202910C7C@A1DAL1SWPES19MB.ams.acs-inc.net> In-Reply-To: <2BF01EB27B56CC478AD6E5A0A28931F202910C7C@A1DAL1SWPES19MB.ams.acs-inc.net> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Mailing-List: contact cygwin-talk-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: cygwin-talk-owner@cygwin.com Reply-To: The Vulgar and Unprofessional Cygwin-Talk List Mail-Followup-To: cygwin-talk@cygwin.com X-SW-Source: 2011-q2/txt/msg00011.txt.bz2 On 5/23/2011 9:44 AM, Nellis, Kenneth wrote: > OTOH, the cygutils "ascii -e" utility does not > recognize that my LANG specifies UTF-8 and outputs garbage for the > extended half. Should this be considered a bug? No, ascii is deliberately intended to be stupid. In fact, the -e option itself is simply a workaround. Originally, ascii always printed the first 256 codepoints unconditionally. With the change in 1.7 to defaulting to UTF-8, we recognized that this was bad, and changed ascii's default behavior to print only the first 128 codepoints, and added -e to restore the original behavior. A bit of history: the reason ascii was originally written was as a diagnostic tool so that folks could check whether their font settings and (old, obsolete, do not do this anymore: CYGWIN var contained charset:oem) var settings were correct, so that the DOS line-drawing characters could be used in a bash shell (running in a cmd box). E.g. poor man's hack to get CP437 working This predated "real" codepage and $LANG handling, in cygwin-1.7. Now, with "real" $LANG handling, line draw stuff Just Works(tm) when LANG=*.UTF-8, at least for ncurses programs: try /usr/lib/ncurses/test/lrtest.exe from the ncurses-demo package, in a bash shell running in a cmd box (or pstree -G). So, the need for ascii (-e) as a diagnostic tool is kinda...not needed anymore. Thus, I'm not too fussed about this "bug" in an obsolete and no longer needed diag tool -- but I also don't see a need to remove it from cygutils. So...mark this "bug" as either NOTABUG or WONTFIX. :-) With regards to your other questions...I dunno. Maybe somebody else does. -- Chuck