bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] Exposing wget functionality for extracting links from a web p


From: Ángel González
Subject: [Bug-wget] Exposing wget functionality for extracting links from a web page
Date: Mon, 16 Sep 2013 01:56:03 +0200
User-agent: Thunderbird

There have been several requests to reuse the wget html parser for finding links or requisites. See "wget questions" thread from August, or "wget source code about -p option" from June.

These two patches provide a new binary doing just that.
The main drawback of the current code is that it misses the connection into the autoconf-generated Makefile. Maybe there is here someone literate on autoconf that can help on that bit?

Meanwhile you can compile it with:
gcc -o get-urls get-urls.c html-url.c html-parse.c hash.c log.c url.c ../lib/c-ctype.c ../lib/quotearg.c ../lib/c-strcasecmp.c ../lib/xalloc-die.c ../lib/xmalloc.c css_.c css-url.c iri.c utils.c ../lib/regex.c ../lib/exitfail.c exits.c host.c ../lib/localcharset.c ../lib/ioctl.c -I../lib -I. -lidn -lpcre

(you will need to compile wget first for having some of those files created)

Regards

Attachment: 0001-Moved-free_urlpos.patch
Description: Text Data

Attachment: 0002-Expose-wget-functionality-for-extracting-links-from-.patch
Description: Text Data


reply via email to

[Prev in Thread] Current Thread [Next in Thread]