[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [SAXParser] org.xml.sax.SAXParseException: not a name start characte
From: |
Chris Burdess |
Subject: |
Re: [SAXParser] org.xml.sax.SAXParseException: not a name start character: "U+26" |
Date: |
Mon, 22 Feb 2010 10:49:00 +0000 |
Andrew Haley wrote:
> On 02/13/2010 11:24 AM, Emmanuel Engelhart wrote:
>> -----BEGIN PGP SIGNED MESSAGE-----
>> Hash: SHA1
>>
>> Hi,
>>
>> not sure to be on the right ML to report this behavior, please help me
>> to find the right place to do it if not... and sorry for the noise.
>>
>> I use gcj on a LTS Ubuntu:
>> gcj (Ubuntu 4.4.1-5ubuntu2) 4.4.1
>>
>> My sample code may be downloaded here as Test.java:
>> https://bugzilla.wikimedia.org/attachment.cgi?id=7115
>>
>> and looks like that:
>> ================================
>> import javax.xml.parsers.SAXParser;
>> import javax.xml.parsers.SAXParserFactory;
>> import org.xml.sax.helpers.DefaultHandler;
>>
>> public class Test {
>>
>> public static void main(String argv[]) {
>>
>> try {
>>
>> SAXParserFactory factory = SAXParserFactory.newInstance();
>> SAXParser saxParser = factory.newSAXParser();
>>
>> DefaultHandler handler = new DefaultHandler() {};
>> saxParser.parse("test.xml", handler);
>>
>> } catch (Exception e) {
>> e.printStackTrace();
>> }
>> }
>> }
>> ================================
>>
>> I compile it like following:
>> gcj -o test --main=Test Test.java
>>
>> My XML file "test.xml" may be downloaded here:
>> https://bugzilla.wikimedia.org/attachment.cgi?id=7114
>>
>> By running the the binary I get the following error:
>> $ ./test
>> org.xml.sax.SAXParseException: not a name start character: "U+26"
>> at gnu.xml.stream.SAXParser.parse(libgcj.so.10)
>> at javax.xml.parsers.SAXParser.parse(libgcj.so.10)
>> at javax.xml.parsers.SAXParser.parse(libgcj.so.10)
>> at Test.main(test)
>> Caused by: javax.xml.stream.XMLStreamException: not a name start
>> character: "U+26"
>> at gnu.xml.stream.XMLParser.error(libgcj.so.10)
>> at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.10)
>> at gnu.xml.stream.XMLParser.readNmtoken(libgcj.so.10)
>> at gnu.xml.stream.XMLParser.readCharData(libgcj.so.10)
>> at gnu.xml.stream.XMLParser.next(libgcj.so.10)
>> at gnu.xml.stream.SAXParser.parse(libgcj.so.10)
>> ...3 more
>>
>> Although the XML should be valid.
>>
>> Has someone an idea to explain this behavior?
>>
>> This "bug" impacts the Mediawiki mwdumper SW. T
>> The related bug (with more details) is available here:
>> https://bugzilla.wikimedia.org/show_bug.cgi?id=22137
>
> Mmmm, looks like it probably is a real bug. If you remind me next
> week I'll have a look.
I reproduced and fixed this bug, see the attached patch. If someone could
validate and commit it for me please do so, I don't have a complete working
build environment at the moment.
--
Chris Burdess
patch
Description: Binary data