[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Lynx-dev] retrieving text from html5 page?
From: |
fa-ml |
Subject: |
Re: [Lynx-dev] retrieving text from html5 page? |
Date: |
Thu, 9 Jan 2014 12:41:25 +0100 |
User-agent: |
Mutt/1.5.21 (2010-09-15) |
On Thu, Jan 09, 2014 at 10:52:57AM +1100, address@hidden wrote:
> now they use html5 stuff, and, page.html has data I want, but, page.txt
> only has 'labels' but not data contents, andy thought how I can do
> that...?
Not a solution but a possibly a workaround
> <label class="pfbc-label">Postcode</label><input type="text"
> name="SYS_Addresses_e_address_i_0_e_postcode_tx" value="2000"
> readonly="readonly" class="ro pfbc-textbox"/>
If data is as structured as the one you pasted, it would make sense to use
a parsing library to extract the needed info from the XML.
It would require a bit more effort, but probably be safer than handling
freeform text.