bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] download that require login/password form submit


From: Keisial
Subject: Re: [Bug-wget] download that require login/password form submit
Date: Fri, 09 Apr 2010 17:14:48 +0200
User-agent: Thunderbird

Voytek Eymont wrote:
> Micah,
>
> thanks !!!!!!
> I'm loging in OK.
>
> on next step I do like:
>
> wget --load-cookies=my-cookies.txt --save-cookies=my-cookies.txt
> --keep-session-cookies
> http://www.domain.tld/main.htm?_template=advanced&_module=active_list
>
> that fails until I put "" around the http string like so:
>
> wget --load-cookies=my-cookies.txt --save-cookies=my-cookies.txt
> --keep-session-cookies
> "http://www.domain.tld/main.htm?_template=advanced&_module=active_list";
>
> or should I use some '%' characters ? for & ? or just " " around https
> string ?
>   

Just surround it with double " " or single ' ' quotes.
If & is not quoted your shell thinks you want to execute a program called
wget and then assign active_list to a shell variable called _module (if
there
wasn't a = it would try to run a program called _module, which would give
you an error message you could notice)
 

> next question: the resulting file has lots and lots of bumpf like
> space.gif galore, etc,
>
> how do I make into text as much as possible, is there a wget function, or ?
>   
Remove anything between < and >, then unescape the entities. That should
give you quite clean text with a minimal effort.






reply via email to

[Prev in Thread] Current Thread [Next in Thread]