[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] What ought to be a simple use of wget
From: |
Dale R. Worley |
Subject: |
Re: [Bug-wget] What ought to be a simple use of wget |
Date: |
Fri, 05 Aug 2016 10:47:22 -0400 |
I've tried some further experiments. One thing I realized was that the
"protocol" directory contains only two files, one of which I wanted, so
I could get very close to the ideal with
$ wget -r --include-directories=/assignments,/protocols
http://www.iana.org/protocols/index.html
Unfortunately, the web site is constructed to confound that, because
protocols/index.html redirects *also* -- to
http://www.iana.org/protocols! (Despite that that's the name of a
directory also.) There's no way to retrieve the root HTML file without
wget considering its "directory" to be "http://www.iana.org/".
So there's no nice solution without either revising the web site or
changing wget's behavior.
Dale
- Re: [Bug-wget] What ought to be a simple use of wget, (continued)