bug-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#37009: EWW Gets Confused on Invalid HTML


From: Eli Zaretskii
Subject: bug#37009: EWW Gets Confused on Invalid HTML
Date: Tue, 13 Aug 2019 21:13:46 +0300

> From: Noam Postavsky <npostavs@gmail.com>
> Date: Tue, 13 Aug 2019 13:55:01 -0400
> Cc: 37009@debbugs.gnu.org
> 
> > Unfortunately, the page does not escape the less-than symbol before "xs"
> > on the second line, so the "<-" (and several more characters) aren't
> > displayed.
> 
> I'm not sure how feasible it will be to fix this at all.  Eww relies on
> libxml for parsing, and it's not as flexible as a typical web browser:
> 
>     (with-temp-buffer
>       (insert "<html>
>       <body>abc <- xyz<body>
>     </html>")
>       (libxml-parse-html-region (point-min) (point-max)))
> 
>     ;=> (html nil (body nil "abc\n"))

Maybe we should report this to libxml developers and hear their
opinion?





reply via email to

[Prev in Thread] Current Thread [Next in Thread]