bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] How to ignore errors with time stamping


From: Andre Majorel
Subject: Re: [Bug-wget] How to ignore errors with time stamping
Date: Thu, 11 Dec 2008 17:25:31 +0100
User-agent: Mutt/1.5.17+20080114 (2008-01-14)

On 2008-12-11 09:17 +0100, Morten Lemvigh wrote:

> I'm having a problem retrieving a page, when I use the time
> stamping option.
>
> When I run wget with:
> wget -N 'http://eur-lex.europa.eu/JOHtml.do?uri=OJ:C:2007:306:SOM:EN:HTML'
>
> the file is downloaded, but I get the message:
> "Last-modified header missing -- time-stamps turned off."
>
> If I run the command a second time, I get an "ERROR 500: Internal Server  
> Error." and wget exits. If I leave the time stamping option out, the  
> document is retrieved again.
>
> Is there a way to make wget ignore missing Last-modified headers, and  
> just retrieve the document?

I believe it's what it does by default. Wget only checks for the
Last-modified header here because you told it to (-N).

> When mirroring a site wget will stop and not  follow any links
> on a page, which doesn't send a Last-modified header.

Do you have a log showing that behaviour ? Recursive retrieval of
sites that don't return Last-modified works for me.

-- 
André Majorel <URL:http://www.teaser.fr/~amajorel/>




reply via email to

[Prev in Thread] Current Thread [Next in Thread]