bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Links Not Parsing Correctly?


From: Derek Tombrello
Subject: Links Not Parsing Correctly?
Date: Sat, 11 Nov 2023 17:23:26 -0600
User-agent: Mozilla Thunderbird

I don't know if this is the right place to ask this, but I can't find anywhere else to turn, so....

So, I'm trying to mirror a site. I'm using 'wget -r -l 0 -k www.site.com <http://www.site.com>' as the command. This works great... almost. The site is paginated in such a way that each successive page is linked using 'index.html?page=2&' where the number is incremented for each page. The index pages are being stored this way on my drive

|index.html index.html?page=2& index.html?page=3& index.html?page=4& ...etc... |

From the main 'index.html' page, if you click on 'page 2', the address bar reflects that it is displaying 'index.html?page=2&' but the actual content is still that of the original 'index.html' page. I can double click on the 'index.html?page=2&' file itself in the file manager and it does, in fact, display the page associated with page 2.

What I am trying to figure out is, is there any EASY way to get the page links to work from within the web page. Or am I going to have to manually rename the 'index.html?page=2&' files and edit the html files to reflect the new names? That's really more than I want to have to do.

Or... is there anything I can do to the command parameters that would correct this behaviour?

I hope all of this makes sense. It does in my head, but... it's cluttered up there....

--



✞ Derek Tombrello (KM4JAG)
www.RobotsAndComputers.com



reply via email to

[Prev in Thread] Current Thread [Next in Thread]