|
From: | Derek Tombrello |
Subject: | Links Not Parsing Correctly? |
Date: | Sat, 11 Nov 2023 17:23:26 -0600 |
User-agent: | Mozilla Thunderbird |
So, I'm trying to mirror a site. I'm using 'wget -r -l 0 -k www.site.com <http://www.site.com>' as the command. This works great... almost. The site is paginated in such a way that each successive page is linked using 'index.html?page=2&' where the number is incremented for each page. The index pages are being stored this way on my drive
|index.html index.html?page=2& index.html?page=3& index.html?page=4& ...etc... |
From the main 'index.html' page, if you click on 'page 2', the address bar reflects that it is displaying 'index.html?page=2&' but the actual content is still that of the original 'index.html' page. I can double click on the 'index.html?page=2&' file itself in the file manager and it does, in fact, display the page associated with page 2.
What I am trying to figure out is, is there any EASY way to get the page links to work from within the web page. Or am I going to have to manually rename the 'index.html?page=2&' files and edit the html files to reflect the new names? That's really more than I want to have to do.
Or... is there anything I can do to the command parameters that would correct this behaviour?
I hope all of this makes sense. It does in my head, but... it's cluttered up there....
-- ✞ Derek Tombrello (KM4JAG) www.RobotsAndComputers.com
[Prev in Thread] | Current Thread | [Next in Thread] |