From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 16313 invoked by alias); 14 Nov 2014 07:15:55 -0000 Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org Received: (qmail 16218 invoked by uid 48); 14 Nov 2014 07:15:49 -0000 From: "maiku.fabian at gmail dot com" To: glibc-bugs@sourceware.org Subject: [Bug localedata/14094] Update locale data to Unicode 7.0.0 Date: Fri, 14 Nov 2014 07:15:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: 2.21 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: maiku.fabian at gmail dot com X-Bugzilla-Status: ASSIGNED X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: pravin.d.s at gmail dot com X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: attachments.created Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-11/txt/msg00111.txt.bz2 https://sourceware.org/bugzilla/show_bug.cgi?id=3D14094 --- Comment #27 from Mike FABIAN --- Created attachment 7931 --> https://sourceware.org/bugzilla/attachment.cgi?id=3D7931&action=3Dedit gen-unicode-ctype.py Python rewrite of Bruno Haible=E2=80=99s gen-unicode-ctype.c. This version produces *exactly* the same output as the C program: $ gcc -o gen-unicode-ctype gen-unicode-ctype.c $ ./gen-unicode-ctype UnicodeData.txt 7.0.0 $ ./gen-unicode-ctype.py -u UnicodeData.txt -o unicode-new --unicode_version 7.0.0 $ diff -u unicode unicode-new $ --=20 You are receiving this mail because: You are on the CC list for the bug. >>From glibc-bugs-return-26620-listarch-glibc-bugs=sources.redhat.com@sourceware.org Fri Nov 14 07:21:19 2014 Return-Path: Delivered-To: listarch-glibc-bugs@sources.redhat.com Received: (qmail 20129 invoked by alias); 14 Nov 2014 07:21:18 -0000 Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org Delivered-To: mailing list glibc-bugs@sourceware.org Received: (qmail 20048 invoked by uid 48); 14 Nov 2014 07:21:14 -0000 From: "maiku.fabian at gmail dot com" To: glibc-bugs@sourceware.org Subject: [Bug localedata/14094] Update locale data to Unicode 7.0.0 Date: Fri, 14 Nov 2014 07:21:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: 2.21 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: maiku.fabian at gmail dot com X-Bugzilla-Status: ASSIGNED X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: pravin.d.s at gmail dot com X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: attachments.created Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-11/txt/msg00112.txt.bz2 Content-length: 956 https://sourceware.org/bugzilla/show_bug.cgi?id=3D14094 --- Comment #28 from Mike FABIAN --- Created attachment 7932 --> https://sourceware.org/bugzilla/attachment.cgi?id=3D7932&action=3Dedit gen-unicode-ctype.py Improved version of gen-unicode-ctype.py which also parses DerivedCoreProperties.txt and uses it (partly) for is_alpha(), is_lower(), and is_upper(). "partly" because of 1): # Consider all the non-ASCII digits as alphabetic. # ISO C 99 forbids us to have them in category =E2=80=9Cdigit= =E2=80=9D, # but we want iswalnum to return true on them. These digits are not =E2=80=9CAlphabetic=E2=80=9D in DerivedCoreProperties.= txt but it seems to makes sense to treat them as alpha according to this comment by Bruno. and 2): title case characters are treated as both upper *and* lower. --=20 You are receiving this mail because: You are on the CC list for the bug. >>From glibc-bugs-return-26621-listarch-glibc-bugs=sources.redhat.com@sourceware.org Fri Nov 14 07:24:07 2014 Return-Path: Delivered-To: listarch-glibc-bugs@sources.redhat.com Received: (qmail 21365 invoked by alias); 14 Nov 2014 07:24:06 -0000 Mailing-List: contact glibc-bugs-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-owner@sourceware.org Delivered-To: mailing list glibc-bugs@sourceware.org Received: (qmail 21278 invoked by uid 48); 14 Nov 2014 07:24:02 -0000 From: "maiku.fabian at gmail dot com" To: glibc-bugs@sourceware.org Subject: [Bug localedata/14094] Update locale data to Unicode 7.0.0 Date: Fri, 14 Nov 2014 07:24:00 -0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: glibc X-Bugzilla-Component: localedata X-Bugzilla-Version: 2.21 X-Bugzilla-Keywords: X-Bugzilla-Severity: normal X-Bugzilla-Who: maiku.fabian at gmail dot com X-Bugzilla-Status: ASSIGNED X-Bugzilla-Priority: P2 X-Bugzilla-Assigned-To: pravin.d.s at gmail dot com X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: security- X-Bugzilla-Changed-Fields: attachments.created Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: http://sourceware.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-SW-Source: 2014-11/txt/msg00113.txt.bz2 Content-length: 356 https://sourceware.org/bugzilla/show_bug.cgi?id=14094 --- Comment #29 from Mike FABIAN --- Created attachment 7933 --> https://sourceware.org/bugzilla/attachment.cgi?id=7933&action=edit report-gen-unicode-ctype.py-DerivedCoreProperties-7.0.0 -- You are receiving this mail because: You are on the CC list for the bug.