[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Bug-wget] Re: Only follow paths with /res/ in them
From: |
Micah Cowan |
Subject: |
[Bug-wget] Re: Only follow paths with /res/ in them |
Date: |
Wed, 19 Nov 2008 20:57:33 -0800 |
User-agent: |
Thunderbird 2.0.0.17 (X11/20080925) |
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Oh! Please don't use this list (address@hidden) any more; I'm trying to
get the dotsrc folks to make it go away/forward to bug-wget (I need to
ping 'em on this again). The official list for Wget is now address@hidden
Micah Cowan wrote:
> Brian wrote:
>> I would like to follow all the urls on a site that contain /res/ in the
>> path. I've tried using -I and -A, with values such as res, *res*,
>> */res/*, etc.. Here is an example that downloads pretty much the entire
>> site, rather than what I appear (to me) to have specified:
>
>> wget -O- -q http://img.site.org/b/imgboard.html | wget -q -r -l1 -O- -I
>> '*res*' -A '*res*' --force-html -B http://img.site.org/b/ -i-
>
>> The urls I would like to follow and output to the command line are of
>> the form:
>
>> http://img.site.org/b/res/97867797.html
>
> -A isn't useful here: it's applied only against the "filename" portion
> of the URL.
>
> -I is what you want; the trouble is that the * wildcard doesn't match
> slashes (there's plans to introduce a ** wildcard, probably in 1.13). So
> unfortunately you gotta do -I'res,*/res,*/*/res' etc as needed.
>
- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer.
GNU Maintainer: wget, screen, teseq
http://micah.cowan.name/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iEYEARECAAYFAkkk7j0ACgkQ7M8hyUobTrH+CACbBzcO4vM6qHIumBeDS2ZyAdfq
ONYAnjX7SHAOvEJylkbjjq7IsDXEv+27
=3Hrq
-----END PGP SIGNATURE-----
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [Bug-wget] Re: Only follow paths with /res/ in them,
Micah Cowan <=