From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 22299 invoked by alias); 13 Feb 2010 11:36:39 -0000 Received: (qmail 22288 invoked by uid 22791); 13 Feb 2010 11:36:38 -0000 X-SWARE-Spam-Status: No, hits=-6.6 required=5.0 tests=AWL,BAYES_00,RCVD_IN_DNSWL_HI,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: sourceware.org Received: from mx1.redhat.com (HELO mx1.redhat.com) (209.132.183.28) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Sat, 13 Feb 2010 11:36:34 +0000 Received: from int-mx08.intmail.prod.int.phx2.redhat.com (int-mx08.intmail.prod.int.phx2.redhat.com [10.5.11.21]) by mx1.redhat.com (8.13.8/8.13.8) with ESMTP id o1DBaWvm021324 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK); Sat, 13 Feb 2010 06:36:33 -0500 Received: from zebedee.pink (ovpn01.gateway.prod.ext.phx2.redhat.com [10.5.9.1]) by int-mx08.intmail.prod.int.phx2.redhat.com (8.13.8/8.13.8) with ESMTP id o1DBaVk9016338; Sat, 13 Feb 2010 06:36:32 -0500 Message-ID: <4B768EBE.4030200@redhat.com> Date: Sat, 13 Feb 2010 11:36:00 -0000 From: Andrew Haley User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.5) Gecko/20091209 Fedora/3.0-4.fc12 Thunderbird/3.0 MIME-Version: 1.0 To: java@gcc.gnu.org CC: classpath@gnu.org Subject: Re: [SAXParser] org.xml.sax.SAXParseException: not a name start character: "U+26" References: <4B768BDA.9050405@engelhart.org> In-Reply-To: <4B768BDA.9050405@engelhart.org> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-IsSubscribed: yes Mailing-List: contact java-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: java-owner@gcc.gnu.org X-SW-Source: 2010-02/txt/msg00001.txt.bz2 On 02/13/2010 11:24 AM, Emmanuel Engelhart wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > Hi, > > not sure to be on the right ML to report this behavior, please help me > to find the right place to do it if not... and sorry for the noise. > > I use gcj on a LTS Ubuntu: > gcj (Ubuntu 4.4.1-5ubuntu2) 4.4.1 > > My sample code may be downloaded here as Test.java: > https://bugzilla.wikimedia.org/attachment.cgi?id=7115 > > and looks like that: > ================================ > import javax.xml.parsers.SAXParser; > import javax.xml.parsers.SAXParserFactory; > import org.xml.sax.helpers.DefaultHandler; > > public class Test { > > public static void main(String argv[]) { > > try { > > SAXParserFactory factory = SAXParserFactory.newInstance(); > SAXParser saxParser = factory.newSAXParser(); > > DefaultHandler handler = new DefaultHandler() {}; > saxParser.parse("test.xml", handler); > > } catch (Exception e) { > e.printStackTrace(); > } > } > } > ================================ > > I compile it like following: > gcj -o test --main=Test Test.java > > My XML file "test.xml" may be downloaded here: > https://bugzilla.wikimedia.org/attachment.cgi?id=7114 > > By running the the binary I get the following error: > $ ./test > org.xml.sax.SAXParseException: not a name start character: "U+26" > at gnu.xml.stream.SAXParser.parse(libgcj.so.10) > at javax.xml.parsers.SAXParser.parse(libgcj.so.10) > at javax.xml.parsers.SAXParser.parse(libgcj.so.10) > at Test.main(test) > Caused by: javax.xml.stream.XMLStreamException: not a name start > character: "U+26" > at gnu.xml.stream.XMLParser.error(libgcj.so.10) > at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.10) > at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.10) > at gnu.xml.stream.XMLParser.readCharData(libgcj.so.10) > at gnu.xml.stream.XMLParser.next(libgcj.so.10) > at gnu.xml.stream.SAXParser.parse(libgcj.so.10) > ...3 more > > Although the XML should be valid. > > Has someone an idea to explain this behavior? > > This "bug" impacts the Mediawiki mwdumper SW. T > The related bug (with more details) is available here: > https://bugzilla.wikimedia.org/show_bug.cgi?id=22137 Mmmm, looks like it probably is a real bug. If you remind me next week I'll have a look. Andrew.