[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[cp-patches] FYI: Implementation of the HTML 4.01 parsing into org.w3c.d
From: |
Meskauskas Audrius |
Subject: |
[cp-patches] FYI: Implementation of the HTML 4.01 parsing into org.w3c.dom.html2.HTMLDocument. |
Date: |
Sat, 19 Mar 2005 23:24:45 +0100 |
After Chris has implemented the Java binding for Level 2 Document Object
Model HTML, it was very simple to direct our parser output into these
classes. Chris did a real job: the document model classes find the requested
properties self dependently. For instance, I did not need to set the FORM
for each INPUT explicitly as the existing getForm() method finds it itself
between the parent nodes.
The DOM model may be very convenient for web robots, because it can be
analysed using transforms.
As this is a new, undocumented feature, I suggest to include an example as
well.
Audrius
DomHTMLParser.java
Description: Binary data
parse_into_dom_html2.java
Description: Binary data
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [cp-patches] FYI: Implementation of the HTML 4.01 parsing into org.w3c.dom.html2.HTMLDocument.,
Meskauskas Audrius <=