From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 31154 invoked by alias); 3 Jan 2009 15:13:46 -0000 Received: (qmail 28861 invoked by uid 48); 3 Jan 2009 15:08:19 -0000 Date: Sat, 03 Jan 2009 15:13:00 -0000 Message-ID: <20090103150819.28860.qmail@sourceware.org> From: "bonzini at gnu dot org" To: glibc-bugs-regex@sources.redhat.com In-Reply-To: <20081230174726.9697.bonzini@gnu.org> References: <20081230174726.9697.bonzini@gnu.org> Reply-To: sourceware-bugzilla@sourceware.org Subject: [Bug regex/9697] character does not match neither [a-z] nor [^a-z] X-Bugzilla-Reason: CC Mailing-List: contact glibc-bugs-regex-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-regex-owner@sourceware.org X-SW-Source: 2009-01/txt/msg00000.txt.bz2 ------- Additional Comments From bonzini at gnu dot org 2009-01-03 15:08 ------- Created an attachment (id=3634) --> (http://sourceware.org/bugzilla/attachment.cgi?id=3634&action=view) working patch Here is the logic I used: - for [^...], [[:class:]], [[=elem=] try at every position that starts a multibyte character. The correct singlebyte character positions are chosen by the corresponding SIMPLE_BRACKET. - for [[.elem.]] and ranges, try at every position that might start a multibyte collation element. Again, singlebyte collation elements are taken care of by SIMPLE_BRACKETs. - unless the second bullet is used, of course, multibyte characters must be added separately to the fastmap. Tested on i686-pc-linux-gnu together with the other patch I sent on 2008-12-31, and on which this depends. Ok? -- What |Removed |Added ---------------------------------------------------------------------------- Attachment #3629 is|0 |1 obsolete| | AssignedTo|drepper at redhat dot com |bonzini at gnu dot org Status|NEW |ASSIGNED http://sourceware.org/bugzilla/show_bug.cgi?id=9697 ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.