bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] recursive_retrieve()


From: Dale R. Worley
Subject: Re: [Bug-wget] recursive_retrieve()
Date: Mon, 06 Mar 2017 11:59:43 -0500

Tim Rühsen <address@hidden> writes:
> Did you try wildcard matching ? (-A "*.pdf*")

That's a bit subtle, though.  The -A pattern apparently has to match
everything in the URL after the final /, *including* the query-part
("?..."), which strictly speaking isn't part of the file name.  But the
documentation of -A/-R (in 1.16) describes it as a pattern to match the
file name:

       -A acclist --accept acclist
       -R rejlist --reject rejlist
           Specify comma-separated lists of file name suffixes or patterns to
           accept or reject. Note that if any of the wildcard characters, *,
           ?, [ or ], appear in an element of acclist or rejlist, it will be
           treated as a pattern, rather than a suffix.  In this case, you have
           to enclose the pattern into quotes to prevent your shell from
           expanding it, like in -A "*.mp3" or -A '*.mp3'.

Then again, what does -A/-R match against in a URL
"http://example.com/file.pdf?a/b/c";?

It seems like this needs something like "Note that this is matched
against the entire part of the URL following the final slash; for URLs
containing queries, it may not be the final component of the path part
of the URL."

Dale



reply via email to

[Prev in Thread] Current Thread [Next in Thread]