classpath-patches
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[cp-patches] FYI: Implementation of the HTML 4.01 parsing into org.w3c.d


From: Meskauskas Audrius
Subject: [cp-patches] FYI: Implementation of the HTML 4.01 parsing into org.w3c.dom.html2.HTMLDocument.
Date: Sat, 19 Mar 2005 23:24:45 +0100

After Chris has implemented the Java binding for Level 2 Document Object Model HTML, it was very simple to direct our parser output into these classes. Chris did a real job: the document model classes find the requested properties self dependently. For instance, I did not need to set the FORM for each INPUT explicitly as the existing getForm() method finds it itself between the parent nodes.

The DOM model may be very convenient for web robots, because it can be analysed using transforms.

As this is a new, undocumented feature, I suggest to include an example as well.

Audrius

Attachment: DomHTMLParser.java
Description: Binary data

Attachment: parse_into_dom_html2.java
Description: Binary data


reply via email to

[Prev in Thread] Current Thread [Next in Thread]