bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] wget source code about -p option


From: Yiwei Yang
Subject: Re: [Bug-wget] wget source code about -p option
Date: Wed, 26 Jun 2013 13:27:38 -0500

Thanks for the detailed walk-through,  Angel. Could you please further
explain after I get all those tags and dispatch handlers for each of them
based on the tag, where does the handler implemented? Or where is the code
for actually retrieving the content for each url with a tag? Thank you!

Lucy


On Tue, Jun 25, 2013 at 6:13 PM, Ángel González <address@hidden> wrote:

> On 26/06/13 00:20, Yiwei Yang wrote:
>
>> Hi,
>>    I'm trying to understand how wget -p find out "everything that supports
>> the web page" to be downloaded . Could someone refer to me where I could
>> find this part of code in wget source code? Thank you!
>>
>> Lucy
>>
> See src/html-url.c
> The interesting tags are described at known_tags (line 91), whose
> attributes
> are listed at tag_url_attributes (line 140).
> The real work is done by src/html-parse.c, but you can treat it as a black
> box
> implementing map_html_tags().
> get_urls_html calls map_html_tags(), which then calls collect_tags_mapper()
> for each tag we marked.
>
> Regards
>
>


reply via email to

[Prev in Thread] Current Thread [Next in Thread]