bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] How do I tell wget not to follow links in a file?


From: Giuseppe Scrivano
Subject: Re: [Bug-wget] How do I tell wget not to follow links in a file?
Date: Thu, 07 Apr 2011 14:26:57 +0200
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.50 (gnu/linux)

"David Skalinder" <address@hidden> writes:

>> I want to mirror part of a website that contains two links pages, each of
>> which contains links to many root-level directories and also to the other
>> links page.  I want to download recursively all the links from one links
>> page, but not from the other: that is, I want to tell wget "download
>> links1 and follow all of its links, but do not download or follow links
>> from links2".
>>
>> I've put a demo of this problem up at http://fangjaw.com/wgettest -- there
>> is a diagram there that might state the problem more clearly.
>>
>> This functionality seems so basic that I assume I must be overlooking
>> something.  Clearly wget has been designed to give users control over
>> which files they download; but all I can find is that -X controls both
>> saving and link-following at the directory level, while -R controls saving
>> at the file level but still follows links from unsaved files.

why doesn't -X work in the scenario you have described?  If all links
from `links2' are under /B, you can exclude them using something like:

wget -r -Xwgettest/B http://fangjaw.com/wgettest

Cheers,
Giuseppe



reply via email to

[Prev in Thread] Current Thread [Next in Thread]