lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: lynx-dev Lynx character entity references fix


From: Leonid Pauzner
Subject: Re: lynx-dev Lynx character entity references fix
Date: Sun, 14 Mar 1999 23:33:31 +0300 (MSK)

12-Mar-99 22:55 I wrote:
>>> There there two more charsets not shown above: iso-8859-1 and us-ascii
>>> (before iso-8859-15) - apparently constant slot #0.

>> I'm mot sure why us-ascii doesn't show up in the TRACE - possibly
>> because 8-bit characters get rejected already in SGML.c, so it never comes
>> this far.  (speculation...)

> No. This is a bug in SGML.c: us-ascii document assumed as iso-8859-1 document
> so 8bit chars not filtered out but "translated from iso-8859-1" !

> Try raw8bit.html under the test/ directory: set "assumed charset" to us-ascii
> and "display charset" to something not latin1, say us-ascii,
> and try to switch "\" several times - no problem with HTPlain.c

Writing from memory I was wrong: both text/html and text/plain
"translate" upper half us-ascii instead of filtering them out
(but 7F translated differently, restricted 128-lowest_eightbit characters
filtered out differently - try uncomment "do not print UHHH for now").

> (I saw this bug a year ago when was preparing this test file
> but was too lazy to fix SGML_character() - it should be merged
> against Fote's 2.7.2 which looked more consistent when I saw it last time,
> the same merge was done for HTPlain.c by me early.)



reply via email to

[Prev in Thread] Current Thread [Next in Thread]