[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
LYNX-DEV Re: new Lynx SGML.c parser
From: |
Klaus Weide |
Subject: |
LYNX-DEV Re: new Lynx SGML.c parser |
Date: |
Wed, 23 Apr 1997 14:43:57 -0500 (CDT) |
On Wed, 23 Apr 1997, Christopher R. Maden wrote:
> [Klaus Weide reports on a more SGML-ish HTML parser.]
>
> Klaus, and anyone else:
>
> One of the reasons I'm so excited about XML is that it's a chance for
> the Web community to make a fresh start and recover from all the crap
> out there that calls itself HTML. Lynx (and other browsers) can
> continue to do what they do now for HTML, but do things the Right Way
> for XML.
Well I was thinking of you when I started this "new parser"[*] project.
I remember you made the claim that a structured parser with error recovery
heuristics could improve handling of invalid markup (or similar wording;
I hope I didn't get your meaning too wrong). So there is now some way
to test that claim... This of course is not doing real SGML parsing,
just trying to resemble it a bit better. (Not that I really understand
all the things a real SGML parser is supposed to do...)
[*] It is also not really a "new" parser, just the old one, with some
exceptions taken out, and some (crude) heuristics and some more
per-element information added in. All changes only refer to the
content models and nesting aspects. The added "DTD" information is
hardwired and looks like this:
#define T_ABBREV 0x0002,0x8B04F,0x8FFFF,0xA778F,0xF7FBF,0x00003,0x00000
which is rather unreadable but fits the info on one line per element :)
and there's still some unused bits left in that...
> I would be *very* interested in beginning work on a new internal MIME
> type for XML handling in Lynx. I need some assistance, though, with
> the HTTP side of things. Would anyone who's interested in beginning
> this project please drop me a line off-list?
Why not keep it on the list? At least I would be interested to hear
what you have in mind.
Klaus
;
; To UNSUBSCRIBE: Send a mail message to address@hidden
; with "unsubscribe lynx-dev" (without the
; quotation marks) on a line by itself.
;
- LYNX-DEV pre-announcing a new Lynx SGML.c parser, Klaus Weide, 1997/04/21
- LYNX-DEV Re: new Lynx SGML.c parser,
Klaus Weide <=
- Re: LYNX-DEV Re: new Lynx SGML.c parser, Christopher R. Maden, 1997/04/25
- LYNX-DEV The method to tag soup madness, Al Gilman, 1997/04/25
- Re: LYNX-DEV Re: new Lynx SGML.c parser, Klaus Weide, 1997/04/25
- Re: LYNX-DEV Re: new Lynx SGML.c parser, Christopher R. Maden, 1997/04/25
- LYNX-DEV Internal MIME types, Klaus Weide, 1997/04/26
- Re: LYNX-DEV Internal MIME types, Al Gilman, 1997/04/26
- Re: LYNX-DEV Internal MIME types, Wayne Buttles, 1997/04/26
- Re: LYNX-DEV Internal MIME types, Klaus Weide, 1997/04/26
- Re: LYNX-DEV Internal MIME types, Al Gilman, 1997/04/26
- Re: LYNX-DEV Internal MIME types, Wayne Buttles, 1997/04/27