bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Trying to mirror some blogs before destruction this month's 21


From: Stephane Ascoet
Subject: Re: Trying to mirror some blogs before destruction this month's 21
Date: Wed, 30 Aug 2023 12:18:07 +0200
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0

Le 20/08/2023 à 14:22, Tim Rühsen a écrit :
Hi,

which version of Wget are you using ? (wget --version)

Hi, I've seen your mail only today, it's 1.21


Wget only replaces links of successfully downloaded pages.
Can you give 1-2 examples of links that haven't been converted?
Can you also send your /tmp/speak as it might contain information that
helps debugging?

The platform is now supposed to be offline, so we can't pursue tests. We could try from the wayback machine but this one causes others problems because of the way it rewrites URIs(I sent a request to them about this a long time ago). Anyway, I succeed with HTTrack...


When testing, the website pretty quickly seem to block my IP.
So I can not really reproduce anything.

I guess they had overload with hundreds of people sucking Websites the last day... I had shortages too while downloading...


These combinations of options are long-used by me and happen to work,
even if I already had to correct links manually(thanks Sed!)

What do you think has changed? Did you update Wget and this may be a
regression?

It was on another Websites, with 1.18



--
Regards, Stephane Ascoet




reply via email to

[Prev in Thread] Current Thread [Next in Thread]