wget-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

wget2 | Force directories for every HTML URL without file extension (#62


From: Mitar (@mitar)
Subject: wget2 | Force directories for every HTML URL without file extension (#628)
Date: Sat, 29 Apr 2023 14:25:40 +0000


Mitar created an issue: https://gitlab.com/gnuwget/wget2/-/issues/628



I was looking into wget/wget2 for archiving a website to GitHub Pages, but the 
issue is that wget breaks URLS and so external links to my website would break 
if I just mirror the website and replace it with GitHub pages.

I think I want an alternative to `--adjust-extension` which would make all 
downloaded HTML URLs without file extension instead into its own directory as 
`index.html`. I would want that `/foo/bar` URL gets downloaded into 
`/foo/bar/index.html`, so that when you then open `/foo/bar` in a browser, 
standard HTTP servers (and GitHub Pages) redirect to `/foo/bar/index.html`.

So with this there would be an extra redirect to new location, but all existing 
inbound links from other sites to archived website would continue to work.

-- 
Reply to this email directly or view it on GitLab: 
https://gitlab.com/gnuwget/wget2/-/issues/628
You're receiving this email because of your account on gitlab.com.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]