[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Librefm-bugs] [bug #32936] lastscrape: BeautifulSoup fails to parse las
From: |
Petr Viktorin |
Subject: |
[Librefm-bugs] [bug #32936] lastscrape: BeautifulSoup fails to parse last.fm pages |
Date: |
Mon, 28 Mar 2011 13:57:27 +0000 |
User-agent: |
Mozilla/5.0 (X11; U; Linux x86_64; en-US) AppleWebKit/534.16 (KHTML, like Gecko) Ubuntu/10.10 Chromium/10.0.648.133 Chrome/10.0.648.133 Safari/534.16 |
Follow-up Comment #1, bug #32936 (project librefm):
The problem is caused by a weird comment: <!–[if IE]><![endif]–> (with
en-dashes instead of two hyphen-minuses) on line 9 of every last.fm page.
Removing this no-op comment makes everything work again.
I believe the attached patch is trivial enough so you can accept it without a
copyright assignment. (And if not, it's trivial to write from scratch.)
I filed a bug in BeautifulSoup at
https://bugs.launchpad.net/beautifulsoup/+bug/744278 – if that gets fixed no
workarounds are necessary.
Also, BeautifulSoup 3.2 works. Lastscrape's README should be updated
accordingly.
(file #23036)
_______________________________________________________
Additional Item Attachment:
File name: lastscrape.diff Size:1 KB
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/bugs/?32936>
_______________________________________________
Message sent via/by Savannah
http://savannah.gnu.org/