[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Lynx-dev] Zero-width space
From: |
Thomas Dickey |
Subject: |
Re: [Lynx-dev] Zero-width space |
Date: |
Sat, 24 Aug 2024 12:37:12 -0400 |
On Sat, Aug 24, 2024 at 08:39:04AM -0600, rbell--- via Lynx-dev wrote:
> lynx displays ​ verbatim for me, which is the point of
> my message.
> I edit lynx to make lynx undisplay it. I first tried adding
> it to the treatment of other undisplaying characters in LYCharUtils.c,
> which didn't work, had to change UCdomap.c to make lynx undisplay it.
>
> I apologize for my invalid example. I don't run UTF,
> translate UTF docs. I've done it for so long I forgot, thought the
> ​ in that 'The New Yorker' article was native.
I don't see a literal ​ in lynx with ISO-8859-1 or 7bit approximations.
Your locale settings (and the effective options) might be relevant.
I see that "&#" is generated in SGML.c, but that would be used in the
source view, rather than the browsing view. Checking that, with
+ running lynx with locale C
+ turning locale-based charset off
+ setting document charset to UTF-8
+ setting display charset to ISO-8859-1
and looking at the source view, I see a few numeric entities, but no 8203.
For example, 8209:
+condenast.com\u002Fuser-agreement\u002F\"\u003EUser Agreement\u003C\u002Fa\u00▒
+3E and \u003Ca href=\"http:\u002F\u002Fwww.condenast.com\u002Fprivacy-policy\u▒
+002F#privacypolicy\"\u003EPrivacy Policy\u003C\u002Fa\u003E\u003C\u002Fp\u003E▒
+","singleDangerousDek":"If, alternatively, you would like to adjust your ▒
+e‑mail preferences, please \u003Ca href=\"\u002Faccount\u002Fsign-i▒
+n\"\u003Esign in\u003C\u002Fa\u003E."},"useManageForSubscribe":true},"newslett▒
+erList":[{"category":"The New Yorker Newsletter","frequencyBadge":"Daily","ind
(the \u's look odd, but appear to be in the original file)
--
Thomas E. Dickey <dickey@invisible-island.net>
https://invisible-island.net
signature.asc
Description: PGP signature