bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] ot: extracting content from d/l pages


From: Ángel González
Subject: Re: [Bug-wget] ot: extracting content from d/l pages
Date: Thu, 09 Jan 2014 00:09:21 +0100
User-agent: Thunderbird

On 07/01/14 04:40, address@hidden wrote:
thanks for all the tips, snippets and encouragement!

I eventually suceeded in 'pushing buttons', and, 'got in', got my page

this is somewhat off topic, but, perhaps there is option that can help me ?

I have like:

wget -O page.html url
links -dump page.html>  page.txt

that worked well, till server got re-developed

when I run the script, page.html DOES contain desired data, BUT, NOT page.txt

looking at page.html it has like[1]:

readonly? is this some sort of attempt to prevent copying of data..?

thanks for any pointers
That's a html5 attribute
http://www.whatwg.org/specs/web-apps/current-work/multipage/common-input-element-attributes.html#the-readonly-attribute

It is not intended to avoid copying, just to make that you cannot change it in the browser but still select, copy, etc. It shouldn't affect your process (unless that attribute somehow confuses links program?)




reply via email to

[Prev in Thread] Current Thread [Next in Thread]