From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 7941 invoked by alias); 4 Dec 2009 09:45:13 -0000 Received: (qmail 7930 invoked by uid 22791); 4 Dec 2009 09:45:11 -0000 X-SWARE-Spam-Status: No, hits=-2.5 required=5.0 tests=AWL,BAYES_00,SARE_SUB_ENC_UTF8,SPF_PASS X-Spam-Check-By: sourceware.org Received: from smtp102.his.com (HELO smtp102.his.com) (216.194.248.178) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Fri, 04 Dec 2009 09:45:06 +0000 Received: from localhost (localhost [127.0.0.1]) by smtp102.his.com (Postfix) with ESMTP id 74C17702101 for ; Fri, 4 Dec 2009 04:45:04 -0500 (EST) Received: from smtp102.his.com ([127.0.0.1]) by localhost (smtp102.his.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 17186-05 for ; Fri, 4 Dec 2009 04:45:02 -0500 (EST) Received: from mail101.his.com (dc-131.his.net [72.66.110.131]) by smtp102.his.com (Postfix) with ESMTP id C07C87020FF for ; Fri, 4 Dec 2009 04:45:02 -0500 (EST) Received: from mail101.his.com (localhost [127.0.0.1]) by mail101.his.com (8.14.3/8.13.3) with ESMTP id nB49j2CW090298 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Fri, 4 Dec 2009 04:45:02 -0500 (EST) (envelope-from dickey@his.com) Received: from localhost (dickey@localhost) by mail101.his.com (8.14.3/8.13.4/Submit) with ESMTP id nB49j2L6090295 for ; Fri, 4 Dec 2009 04:45:02 -0500 (EST) (envelope-from dickey@his.com) Date: Fri, 04 Dec 2009 09:45:00 -0000 From: Thomas Dickey To: cygwin-xfree@cygwin.com Subject: Re: X11R7.5 and C.UTF-8 In-Reply-To: Message-ID: <20091204044346.M57150@mail101.his.com> References: <4AE8539E.9080004@cornell.edu> <20091028172216.P60895@mail101.his.com> <4AE8BC12.1060109@cornell.edu> <416096c60910281507n4774534dode1d24ac47d5b0a2@mail.gmail.com> <4B1115EC.7010308@cornell.edu> <4B174C20.1040900@tlinx.org> <416096c60912022348i36504e14l726efc9fc9c360e6@mail.gmail.com> <20091203045401.L85368@mail101.his.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Spam-Score: -2.452 X-IsSubscribed: yes Mailing-List: contact cygwin-xfree-help@cygwin.com; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-xfree-owner@cygwin.com Reply-To: cygwin-xfree@cygwin.com Mail-Followup-To: cygwin-xfree@cygwin.com X-SW-Source: 2009-12/txt/msg00036.txt.bz2 On Thu, 3 Dec 2009, Eric Blake wrote: > Thomas Dickey his.com> writes: > >>> This means that characters 0..127 have to be treated as ASCII, but > > No, it means that portable characters and control characters must be < 128. > ASCII meets this characteristic, but so does EBCDIC, as well as UTF-8. The C > locale also implies that you can manipulate bytes >= 128 in the naive manner, > so long as you don't care about characters embedded in those bytes. And what > do you know - ASCII, EBCDIC, and UTF-8 all meet this property, too. > >>> beyond that an implementation can do what it wants. And on Cygwin 1.7, >>> plain "C" actually does imply UTF-8, which happily is >>> backward-compatible with ASCII. >> >> That's an interpretation that so far hasn't been blessed by the standards >> people. Any discussion of this topic should mention that, as a caveat. > > Actually, the standards people HAVE spoken - and they agreed with our > interpretation. POSIX was INTENTIONALLY written with the intent that a UTF-8 > encoding is valid for the C locale, for the same reason that it was written > that an EBCDIC encoding is valid for the C locale. These emails from the > Austin Group (the folks that write POSIX) are telling: > > https://www.opengroup.org/sophocles/show_mail.tpl? > CALLER=show_archive.tpl&source=L&listname=austin-group-l&id=12982 This is basically your email on the matter. > https://www.opengroup.org/sophocles/show_mail.tpl? > CALLER=show_archive.tpl&source=L&listname=austin-group-l&id=13012 > > But they also admitted that there is still more work needed in POSIX to make > this intent clearly codified (for example, that control characters must be > single bytes < 128). But they have not actually agreed with you yet. -- Thomas E. Dickey http://invisible-island.net ftp://invisible-island.net -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://x.cygwin.com/docs/ FAQ: http://x.cygwin.com/docs/faq/