https://sourceware.org/bugzilla/show_bug.cgi?id=14094 --- Comment #27 from Mike FABIAN --- Created attachment 7931 --> https://sourceware.org/bugzilla/attachment.cgi?id=7931&action=edit gen-unicode-ctype.py Python rewrite of Bruno Haible’s gen-unicode-ctype.c. This version produces *exactly* the same output as the C program: $ gcc -o gen-unicode-ctype gen-unicode-ctype.c $ ./gen-unicode-ctype UnicodeData.txt 7.0.0 $ ./gen-unicode-ctype.py -u UnicodeData.txt -o unicode-new --unicode_version 7.0.0 $ diff -u unicode unicode-new $ -- You are receiving this mail because: You are on the CC list for the bug. >From glibc-bugs-return-26620-listarch-glibc-bugs=sources.redhat.com@sourceware.org Fri Nov 14 07:21:19 2014 Return-Path: Delivered-To: listarch-glibc-bugs@sources.redhat.com Received: (qmail 20129 invoked by alias); 14 Nov 2014 07:21:18 -0000 Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org Delivered-To: mailing list glibc-bugs@sourceware.org Received: (qmail 20048 invoked by uid 48); 14 Nov 2014 07:21:14 -0000 From: "maiku.fabian at gmail dot com" To: glibc-bugs@sourceware.org Subject: [Bug localedata/14094] Update locale data to Unicode 7.0.0 Date: Fri, 14 Nov 2014 07:21:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: 2.21 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: maiku.fabian at gmail dot com X-Bugzilla-Status: ASSIGNED X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: pravin.d.s at gmail dot com X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: attachments.created Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-11/txt/msg00112.txt.bz2 Content-length: 956 https://sourceware.org/bugzilla/show_bug.cgi?id=14094 --- Comment #28 from Mike FABIAN --- Created attachment 7932 --> https://sourceware.org/bugzilla/attachment.cgi?id=7932&action=edit gen-unicode-ctype.py Improved version of gen-unicode-ctype.py which also parses DerivedCoreProperties.txt and uses it (partly) for is_alpha(), is_lower(), and is_upper(). "partly" because of 1): # Consider all the non-ASCII digits as alphabetic. # ISO C 99 forbids us to have them in category “digit”, # but we want iswalnum to return true on them. These digits are not “Alphabetic” in DerivedCoreProperties.txt but it seems to makes sense to treat them as alpha according to this comment by Bruno. and 2): title case characters are treated as both upper *and* lower. -- You are receiving this mail because: You are on the CC list for the bug. >From glibc-bugs-return-26621-listarch-glibc-bugs=sources.redhat.com@sourceware.org Fri Nov 14 07:24:07 2014 Return-Path: Delivered-To: listarch-glibc-bugs@sources.redhat.com Received: (qmail 21365 invoked by alias); 14 Nov 2014 07:24:06 -0000 Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org Delivered-To: mailing list glibc-bugs@sourceware.org Received: (qmail 21278 invoked by uid 48); 14 Nov 2014 07:24:02 -0000 From: "maiku.fabian at gmail dot com" To: glibc-bugs@sourceware.org Subject: [Bug localedata/14094] Update locale data to Unicode 7.0.0 Date: Fri, 14 Nov 2014 07:24:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: 2.21 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: maiku.fabian at gmail dot com X-Bugzilla-Status: ASSIGNED X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: pravin.d.s at gmail dot com X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: attachments.created Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-11/txt/msg00113.txt.bz2 Content-length: 356 https://sourceware.org/bugzilla/show_bug.cgi?id094 --- Comment #29 from Mike FABIAN --- Created attachment 7933 --> https://sourceware.org/bugzilla/attachment.cgi?idy33&actionit report-gen-unicode-ctype.py-DerivedCoreProperties-7.0.0 -- You are receiving this mail because: You are on the CC list for the bug.