From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 4651 invoked by alias); 12 Nov 2013 14:52:26 -0000 Mailing-List: contact libc-locales-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: libc-locales-owner@sourceware.org Received: (qmail 19483 invoked by uid 89); 12 Nov 2013 14:35:39 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=1.0 required=5.0 tests=AWL,BAYES_50,RDNS_NONE,SPF_PASS autolearn=no version=3.3.2 X-HELO: smtp.fuse.net X_CMAE_Category: 0,0 Undefined,Undefined X-CNFS-Analysis: v=2.1 cv=bNwvfpOZ c=1 sm=0 tr=0 a=/zSH5aKRJzkJ9FH+bwxRmw==:117 a=/zSH5aKRJzkJ9FH+bwxRmw==:17 a=K-v-2zaBAAAA:8 a=lElklofeYTgA:10 a=kj9zAlcOel0A:10 a=7OTTqeckAAAA:8 a=tMzTUZlfvCQA:10 a=tYdNYvdDG3N1FkrqA-MA:9 a=CjuIK1q_8ugA:10 X-CM-Score: 0 Authentication-Results: smtp02.cincibell.synacor.com smtp.mail=pheonix@zoomtown.com; spf=unknown; sender-id=unknown Authentication-Results: smtp02.cincibell.synacor.com header.from=pheonix@zoomtown.com; sender-id=unknown Authentication-Results: smtp02.cincibell.synacor.com smtp.user=pheonix@zoomtown.com; auth=pass (LOGIN) Received-SPF: error (smtp02.cincibell.synacor.com: 72.49.120.74 is neither permitted nor denied by domain of zoomtown.com Subject: Re: locale encodings Mime-Version: 1.0 (Apple Message framework v1085) Content-Type: text/plain; charset=us-ascii From: Steven Abner In-Reply-To: <20131112133642.GA22738@rap.rap.dk> Date: Tue, 12 Nov 2013 14:52:00 -0000 Cc: Troy Korjuslommi , libc-locales@sourceware.org Content-Transfer-Encoding: quoted-printable Message-Id: <98244D14-49A6-4953-8F6B-9D393E435324@zoomtown.com> References: <31AACAB8-A716-47CC-B755-F33DD77BA51E@zoomtown.com> <1384174607.4028.8.camel@uno11.loco> <20131112012257.GA31828@rap.rap.dk> <5281BEB1.2010909@redhat.com> <20131112133642.GA22738@rap.rap.dk> To: Keld Simonsen X-SW-Source: 2013-q4/txt/msg00081.txt.bz2 On 12 Nov 2013, at 8:36 AM, Keld Simonsen wrote: > On Tue, Nov 12, 2013 at 12:37:53AM -0500, Carlos O'Donell wrote: >> On 11/11/2013 08:22 PM, Keld Simonsen wrote: >>> Well, the encoding of the source coode of all locales should be 7-bit a= scii, for >>> maximum portability. Then the target encoding should be recorded via th= e=20 >>> % charset specification, which gives a list of possible charsets, comma= separated. >>> UTF-8 should always be included there, but other encodings should also = be available. >>=20 >> So one of the points that we've been trying to gather consensus on is: >> Is it really important to have 7-bit ASCII? Why not use UTF-8 for the >> the locale source? It's readily readable by all editors and allows >> language specific comments in teh source files for maximum maintenance. >=20 > I think to have UTF-8 is a bad idea, eg for embedded systems, and for sys= tems that is > not maintained in UTF-8. It also can give trouble when communicating the = source. FWIW all data that is important, save one, is in POSIX's 7-bit ASCII. From = the ones I've examined and patched, seem to be an almost identical copy from Section 7 of The Open Gro= up Base Specifications. There are some that have minor data problems, but I was trying to access th= e default character set. That happens to be in the "comments" section for some reason. Steve