On 17.04.23 21:22, BERBAR Florian wrote:
Hi folk,
I have question about recursive downloading of webpages. Trying to
download all pages from a website using recursing option
(--recursive) on wget 1.21, the webpages processing seems to don't
follow form "action" attributs of "form" tags.
- Does it be the expecting behavior?
- Is there a combination of options to download all pages of a
website with the attribut "action"?
Exemple with 3 HTML pages :
- Page 1 - form.html : HTML form with "action" attribut pointing to
"Page 2"
- Page 2 - post.html : HTML page with a link to "Page 3".
- Page 3 - link.html : HTML page without link.
I tried this command to download all tree pages but only "Page 1" was
downloaded:
$ wget -r https://host/form.html
I tried "--follow-tags=form" option but the same behavior was observed.
Generally, Wget supports form tags with action attributes.
So maybe you encounter malformed HTML or there is a bug in Wget.
Could you please give us a copy of that page, or at least the HTML
part containing the form tags ?
Regards, Tim
Regards,
Florian