Re: [Bug-wget] --spider output interpretation

bug-wget

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] --spider output interpretation

From:	Ángel González
Subject:	Re: [Bug-wget] --spider output interpretation
Date:	Sat, 22 Jun 2013 00:32:14 +0200
User-agent:	Thunderbird

On 21/06/13 17:47, andreas wpv wrote:

Can  you help me understand what this means? Just pulling the urls to build
a sitemap with it, but there is so much more in the output - and I cannot
find any documentation of the details in the output:

command:
wget --spider --recursive --no-verbose --no-parent -t 3
--output-file=wgetlog.txt $URL

example
2013-06-17 18:56:54 URL:http:/domain/file/19[149408/149408] ->
"domain/file/19.1" [1]


Bunch of questions:
1 what do numbers in brackets mean?

Page length. Downloaded 149408 out of 149408 bytes (if the server
hadn't provided a Content-Length, you would only have one number there).

2. why is there a second url?

domain/file/19.1 would be the file where it would store the web page
(actually, --spider is deleting the files after downloading, so you will
have a "domain" folder sitting there).

3. why is the second url modified with a '.1'  at the end?

There was a file called "domain/file/19", so it was downloaded to"domain/file/19.1"

4. what does the '[1]' at the end mean? Error code is kind of unlikely,
right?

That's the try count. It was downloaded at the first time.


Regards

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug-wget] --spider output interpretation, andreas wpv, 2013/06/21
- Re: [Bug-wget] --spider output interpretation, Ángel González <=

Prev by Date: [Bug-wget] cmd_string_uppercase
Next by Date: Re: [Bug-wget] Problems with pod2man
Previous by thread: [Bug-wget] --spider output interpretation
Next by thread: [Bug-wget] cmd_string_uppercase
Index(es):
- Date
- Thread