bug-gnupod
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-gnupod] Podcast character data truncated and ' not escaped in XML g


From: H. Langos
Subject: [Bug-gnupod] Podcast character data truncated and ' not escaped in XML generation
Date: Fri, 11 Apr 2008 12:50:54 +0200
User-agent: Mutt/1.5.13 (2006-08-11)

Hi Adrian,

Podcast titles that contain escaped xml characters (like &) often get
truncated at that character ... The reason is that the parser doesn't
have to return all the character data at once.

See: http://perl-xml.sourceforge.net/faq/#char_events

So a podcast containing <title>The new Q&amp;A Show</title> can result in
"The new Q"

I guess the same error occurs when you read the GNUtunesDB.xml to create
the itunes.db

Also I noticed that in XMLhelper::xescaped you seem to have forgotten to
escape the ' character (single quote or apostrophe).

The later bug probably rarely caused problems because the first bug
already truncated cdata at that point.

If you need a podcast to test it on, I'd recommend this one:

http://www.thenakedscientists.com/naked_scientists_podcast.xml

It contains XML entities in the titles and lots of apostrophs in the
descriptions. (thanx David for pointing me there)

Cheers
-henrik





reply via email to

[Prev in Thread] Current Thread [Next in Thread]