From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 18356 invoked by alias); 13 Feb 2010 11:24:14 -0000 Received: (qmail 18346 invoked by uid 22791); 13 Feb 2010 11:24:13 -0000 X-SWARE-Spam-Status: No, hits=-2.6 required=5.0 tests=BAYES_00 X-Spam-Check-By: sourceware.org Received: from relay2-d.mail.gandi.net (HELO relay2-d.mail.gandi.net) (217.70.183.194) by sourceware.org (qpsmtpd/0.43rc1) with ESMTP; Sat, 13 Feb 2010 11:24:08 +0000 Received: from [192.168.10.106] (196-109.62-188.cust.bluewin.ch [188.62.109.196]) by relay2-d.mail.gandi.net (Postfix) with ESMTPA id C8355225191 for ; Sat, 13 Feb 2010 12:24:01 +0100 (CET) Message-ID: <4B768BDA.9050405@engelhart.org> Date: Sat, 13 Feb 2010 11:24:00 -0000 From: Emmanuel Engelhart Reply-To: emmanuel@engelhart.org User-Agent: Thunderbird 2.0.0.23 (X11/20090817) MIME-Version: 1.0 To: java@gcc.gnu.org Subject: [SAXParser] org.xml.sax.SAXParseException: not a name start character: "U+26" Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Mailing-List: contact java-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: java-owner@gcc.gnu.org X-SW-Source: 2010-02/txt/msg00000.txt.bz2 -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi, not sure to be on the right ML to report this behavior, please help me to find the right place to do it if not... and sorry for the noise. I use gcj on a LTS Ubuntu: gcj (Ubuntu 4.4.1-5ubuntu2) 4.4.1 My sample code may be downloaded here as Test.java: https://bugzilla.wikimedia.org/attachment.cgi?id=7115 and looks like that: ================================ import javax.xml.parsers.SAXParser; import javax.xml.parsers.SAXParserFactory; import org.xml.sax.helpers.DefaultHandler; public class Test { public static void main(String argv[]) { try { SAXParserFactory factory = SAXParserFactory.newInstance(); SAXParser saxParser = factory.newSAXParser(); DefaultHandler handler = new DefaultHandler() {}; saxParser.parse("test.xml", handler); } catch (Exception e) { e.printStackTrace(); } } } ================================ I compile it like following: gcj -o test --main=Test Test.java My XML file "test.xml" may be downloaded here: https://bugzilla.wikimedia.org/attachment.cgi?id=7114 By running the the binary I get the following error: $ ./test org.xml.sax.SAXParseException: not a name start character: "U+26" at gnu.xml.stream.SAXParser.parse(libgcj.so.10) at javax.xml.parsers.SAXParser.parse(libgcj.so.10) at javax.xml.parsers.SAXParser.parse(libgcj.so.10) at Test.main(test) Caused by: javax.xml.stream.XMLStreamException: not a name start character: "U+26" at gnu.xml.stream.XMLParser.error(libgcj.so.10) at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.10) at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.10) at gnu.xml.stream.XMLParser.readCharData(libgcj.so.10) at gnu.xml.stream.XMLParser.next(libgcj.so.10) at gnu.xml.stream.SAXParser.parse(libgcj.so.10) ...3 more Although the XML should be valid. Has someone an idea to explain this behavior? This "bug" impacts the Mediawiki mwdumper SW. T The related bug (with more details) is available here: https://bugzilla.wikimedia.org/show_bug.cgi?id=22137 Regards Emmanuel -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iEYEARECAAYFAkt2i9gACgkQn3IpJRpNWtPy7ACeMKYcXxFh5l1T28KCA2uen5Qs DOwAoJsTg6aHlEQWFJX1yFR29IJx50/i =HxoD -----END PGP SIGNATURE-----