From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 58386 invoked by alias); 24 Jul 2017 13:10:06 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: libc-alpha-owner@sourceware.org Received: (qmail 58264 invoked by uid 89); 24 Jul 2017 13:10:05 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-0.9 required=5.0 tests=AC_HTML_NONSENSE_TAGS,BAYES_00,KAM_LAZY_DOMAIN_SECURITY,RP_MATCHES_RCVD,SPF_HELO_PASS autolearn=no version=3.3.2 spammy=frana, fran, Frana, HContent-Transfer-Encoding:8bit X-HELO: mx1.redhat.com DMARC-Filter: OpenDMARC Filter v1.3.2 mx1.redhat.com ED7237F3E3 Authentication-Results: ext-mx01.extmail.prod.ext.phx2.redhat.com; dmarc=none (p=none dis=none) header.from=redhat.com Authentication-Results: ext-mx01.extmail.prod.ext.phx2.redhat.com; spf=pass smtp.mailfrom=mfabian@redhat.com DKIM-Filter: OpenDKIM Filter v2.11.0 mx1.redhat.com ED7237F3E3 From: Mike FABIAN To: libc-alpha@sourceware.org Subject: Is it OK to write ASCII strings directly into locale source files? Date: Mon, 24 Jul 2017 13:13:00 -0000 Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-SW-Source: 2017-07/txt/msg00808.txt.bz2 Currently the locale source files use a lot of code points even for strings which are pure ASCII. For example localedata/locales/de_DE contains: % "%a %d %b %Y %T %Z" d_t_fmt "" Would it be OK to write this as d_t_fmt "%a %d %b %Y %T %Z" ?? This would make the files much more readable. Stuff that is mostly ASCII can probably be written like this: % https://oc.wikipedia.org/wiki/Fran%C3%A7a França country_name "Frana" which is already more readable then writing it all in code points. It would be even nicer to write it completely in UTF-8, i.e.: country_name "França" but I am not sure whether this is allowed in the locale source files. But at least for everything which is ASCII, it might be OK already to write the characters directly. Is writing ASCII there allowed or not?? -- Mike FABIAN