From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 9770 invoked by alias); 25 Jan 2005 10:32:04 -0000 Mailing-List: contact glibc-bugs-regex-help@sources.redhat.com; run by ezmlm Precedence: bulk List-Subscribe: List-Post: List-Help: , Sender: glibc-bugs-regex-owner@sources.redhat.com Received: (qmail 9608 invoked by uid 48); 25 Jan 2005 10:31:47 -0000 Date: Tue, 25 Jan 2005 10:32:00 -0000 From: "kasal at ucw dot cz" To: glibc-bugs-regex@sources.redhat.com Message-ID: <20050125103144.693.kasal@ucw.cz> Reply-To: sourceware-bugzilla@sources.redhat.com Subject: [Bug regex/693] New: mishandled '\B' X-Bugzilla-Reason: CC X-SW-Source: 2005-01/txt/msg00001.txt.bz2 List-Id: The GNU extension '\B' has always meant non-\b. The dfa.[ch] code included in grep and gawk still handles it this way. Try echo ' ' | grep ' \B ' on your system, try gsub(/ \B /,...) in gawk-3.1.1 or gawk '/ B /' with current gawk. I have checked Perl documentation; it also defines '\B' and non-\b. But current regex has changed to interpret '\B' as inword space. Try the above gsub with current awk. See also http://lists.gnu.org/archive/html/bug-gnu-utils/2005-01/msg00087.html IMHO, the current regex code is not correct. -- Summary: mishandled '\B' Product: glibc Version: unspecified Status: NEW Severity: normal Priority: P2 Component: regex AssignedTo: gotom at debian dot or dot jp ReportedBy: kasal at ucw dot cz CC: glibc-bugs-regex at sources dot redhat dot com,glibc- bugs at sources dot redhat dot com http://sources.redhat.com/bugzilla/show_bug.cgi?id=693 ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.