bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Links Not Parsing Correctly?


From: Derek Tombrello
Subject: Re: Links Not Parsing Correctly?
Date: Tue, 14 Nov 2023 12:22:40 -0600
User-agent: Mozilla Thunderbird

I appreciate that. I'll check that out. In the mean time, I came up with a bash script to fix the issue with the ones I've already downloaded. In case anyone else is interested or needs it, two simply commands run in the same directory as the index.html files:

rename 's/index\.html\?page=([0-9]+)\&/index$1.html/' *
sed -Ei 's/index\.html\?page=([0-9]+)\&/index\1.html/' *.html




✞ Derek Tombrello (KM4JAG)
www.RobotsAndComputers.com


/"First they came for the Communists, but I was not a Communist so I did not speak out. Then they came for the Socialists and the Trade Unionists, but I was neither, so I did not speak out.
Then they came for the Jews, but I was not a Jew so I did not speak out.
And when they came for me, there was no one left to speak out for me."
/
------------------------------------------------------------------------

/"Every record has been destroyed or falsified, every book rewritten, every picture has been repainted, every statue and street building has been renamed, every date has been altered. And the process is continuing day by day and minute by minute. History has stopped. Nothing exists except an endless present in which the Party
is always right." - George Orwell, "1984" /


On 11/13/23 02:33, Stephane Ascoet wrote:
Le 12/11/2023 à 18:00, bug-wget-request@gnu.org a écrit :
From: Derek Tombrello <RebelTaz@RobotsAndComputers.com>
To: bug-wget@gnu.org
Subject: Links Not Parsing Correctly?

 From the main 'index.html' page, if you click on 'page 2', the address
bar reflects that it is displaying 'index.html?page=2&' but the actual
content is still that of the original 'index.html' page. I can double
click on the 'index.html?page=2&' file itself in the file manager and it
does, in fact, display the page associated with page 2.



Hi, I had almost exactly the same problem a few months ago and got no solution except migrating to WebHTTrack. You probably can find the thread in the archives, beginning on the 19/8/2023


reply via email to

[Prev in Thread] Current Thread [Next in Thread]