bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Recursive downloading of pages through the "action" attributes of th


From: Tim Rühsen
Subject: Re: Recursive downloading of pages through the "action" attributes of the following "form" tags
Date: Sat, 22 Apr 2023 20:21:29 +0200
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.7.2

On 17.04.23 21:22, BERBAR Florian wrote:
Hi folk,

I have question about recursive downloading of webpages. Trying to download all pages from a website using recursing option (--recursive) on wget 1.21, the webpages processing seems to don't follow form "action" attributs of "form" tags.

- Does it be the expecting behavior?
- Is there a combination of options to download all pages of a website with the attribut "action"?


Exemple with 3 HTML pages :

- Page 1 - form.html : HTML form with "action" attribut pointing to "Page 2"
- Page 2 - post.html : HTML page with a link to "Page 3".
- Page 3 - link.html : HTML page without link.

I tried this command to download all tree pages but only "Page 1" was downloaded:

$ wget -r https://host/form.html


I tried "--follow-tags=form" option but the same behavior was observed.

Generally, Wget supports form tags with action attributes.
So maybe you encounter malformed HTML or there is a bug in Wget.

Could you please give us a copy of that page, or at least the HTML part containing the form tags ?

Regards, Tim



Regards,

Florian

Attachment: OpenPGP_signature
Description: OpenPGP digital signature


reply via email to

[Prev in Thread] Current Thread [Next in Thread]