bug-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#37009: EWW Gets Confused on Invalid HTML


From: Noam Postavsky
Subject: bug#37009: EWW Gets Confused on Invalid HTML
Date: Tue, 13 Aug 2019 13:55:01 -0400
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1.92 (windows-nt)

Nick Daly <address@hidden> writes:

> EWW gets confused when facing unescaped less-than-signs.

> Unfortunately, the page does not escape the less-than symbol before "xs"
> on the second line, so the "<-" (and several more characters) aren't
> displayed.

I'm not sure how feasible it will be to fix this at all.  Eww relies on
libxml for parsing, and it's not as flexible as a typical web browser:

    (with-temp-buffer
      (insert "<html>
      <body>abc <- xyz<body>
    </html>")
      (libxml-parse-html-region (point-min) (point-max)))

    ;=> (html nil (body nil "abc\n"))








reply via email to

[Prev in Thread] Current Thread [Next in Thread]