[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] --spider output interpretation
From: |
Ángel González |
Subject: |
Re: [Bug-wget] --spider output interpretation |
Date: |
Sat, 22 Jun 2013 00:32:14 +0200 |
User-agent: |
Thunderbird |
On 21/06/13 17:47, andreas wpv wrote:
Can you help me understand what this means? Just pulling the urls to build
a sitemap with it, but there is so much more in the output - and I cannot
find any documentation of the details in the output:
command:
wget --spider --recursive --no-verbose --no-parent -t 3
--output-file=wgetlog.txt $URL
example
2013-06-17 18:56:54 URL:http:/domain/file/19[149408/149408] ->
"domain/file/19.1" [1]
Bunch of questions:
1 what do numbers in brackets mean?
Page length. Downloaded 149408 out of 149408 bytes (if the server
hadn't provided a Content-Length, you would only have one number there).
2. why is there a second url?
domain/file/19.1 would be the file where it would store the web page
(actually, --spider is deleting the files after downloading, so you will
have a "domain" folder sitting there).
3. why is the second url modified with a '.1' at the end?
There was a file called "domain/file/19", so it was downloaded to
"domain/file/19.1"
4. what does the '[1]' at the end mean? Error code is kind of unlikely,
right?
That's the try count. It was downloaded at the first time.
Regards