bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] How to prevent .1.html numbering of downloaded file?


From: B Wooster
Subject: [Bug-wget] How to prevent .1.html numbering of downloaded file?
Date: Fri, 28 Nov 2014 14:11:00 -0500

This only happens to some of my downloads - presumably there was a conflict
that caused it to name something .1.html? But I can't see any reason for it
in the log file.

Example of downloaded files:
albums/
albums.1.html
article/
article.html
band/
band.html
blog/
blog.1.html
etc

I don't see any mention of a albums.html in the log, just the albums.1.html

This was done for a fresh wget download, nothing in target directory.
wget --recursive --page-requisites --timestamping --level=9
--exclude-directories=/cgi-bin,/files,/fonts --adjust-extension
--execute=robots=off --convert-links -P tmp.wget
'--reject-regex=(.*/email.html)' -o log1 http://www.example.com/

I'm trying to make a local archive of a local Drupal site, and can deal
with the .html suffix, but cannot handle a .1 or .2 etc suffix... for now,
am just trying to understand why it added .1 to some files above and not
all.

It seems running it a bunch of times gets different files with the number -
sometimes I do get blog.html instead of blog.1.html (but that may be due to
other reasons, downloading a partial site.)


reply via email to

[Prev in Thread] Current Thread [Next in Thread]