Re: [Bug-wget] ot: extracting content from d/l pages

bug-wget

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] ot: extracting content from d/l pages

From:	Ángel González
Subject:	Re: [Bug-wget] ot: extracting content from d/l pages
Date:	Thu, 09 Jan 2014 00:09:21 +0100
User-agent:	Thunderbird

On 07/01/14 04:40, address@hidden wrote:

thanks for all the tips, snippets and encouragement!

I eventually suceeded in 'pushing buttons', and, 'got in', got my page

this is somewhat off topic, but, perhaps there is option that can help me ?

I have like:

wget -O page.html url
links -dump page.html>  page.txt

that worked well, till server got re-developed

when I run the script, page.html DOES contain desired data, BUT, NOT page.txt

looking at page.html it has like[1]:

readonly? is this some sort of attempt to prevent copying of data..?

thanks for any pointers

That's a html5 attribute
http://www.whatwg.org/specs/web-apps/current-work/multipage/common-input-element-attributes.html#the-readonly-attribute

It is not intended to avoid copying, just to make that you cannot changeit in the browser but still select, copy, etc. It shouldn't affect yourprocess (unless that attribute somehow confuses links program?)

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug-wget] ot: extracting content from d/l pages, voytek, 2014/01/06
- Re: [Bug-wget] ot: extracting content from d/l pages, Ángel González <=
  - Re: [Bug-wget] ot: extracting content from d/l pages, voytek, 2014/01/08

Prev by Date: Re: [Bug-wget] wget alpha release 1.14.96-38327
Next by Date: Re: [Bug-wget] ot: extracting content from d/l pages
Previous by thread: [Bug-wget] ot: extracting content from d/l pages
Next by thread: Re: [Bug-wget] ot: extracting content from d/l pages
Index(es):
- Date
- Thread