Re: Can watermarking Unicode text using invisible differences sneak thro

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Can watermarking Unicode text using invisible differences sneak thro

From:	T.V Raman
Subject:	Re: Can watermarking Unicode text using invisible differences sneak through Emacs, or can Emacs detect it?
Date:	Wed, 19 Jan 2022 09:36:56 -0800
User-agent:	Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux)

Richard Stallman <rms@gnu.org> writes:


This is indeed  worrysome and has been around for a while. There is an
even more insidious form of this hack where unicode chars that "appear
like english letters" can be used  --and a quick visual scan will miss
it -- the trick is often used by spammers in domain-names within URLs as
 an example. As an example, there are Cyrillic letters that "look like"
 Roman letters.
 > [[[ To any NSA and FBI agents reading my email: please consider    ]]]
> [[[ whether defending the US Constitution against all enemies,     ]]]
> [[[ foreign or domestic, requires you to follow Snowden's example. ]]]
>
> There is a thread now about confusables.
>
> I read this,
>
>    Unicode allows user tracking by means of invisible text marking. Any
>    string can be converted into its binary form and then recoded into a
>    string of zero-width characters, which can then be invisibly inserted
>    into the text. If the text is posted elsewhere, the zero-width
>    character string can be extracted and the process reversed to figure
>    out the identity of the person who copied it.
>
> which seems ot be about a special case of confusables, and it makes me
> wonder whether Emacs does, or could, show users when Unicode confusion
> occurs, or prevent or fix it somehow.
>
> First, is that issue of invisible characters real?
>
> Second, does Emacs do anything now such that these tricks
> won't succeed?
>
> If the problem exists in Emacs now, could we prevent it?  I see a few
> ways to try.  I don't know whether they would work well.
>
> * Indicate the different encodings on the screen somehow.
>
> * Canonicalize such seqences (perhaps when reading text into Emacs),
> so that different encodings of the same text become identical.
>
> * Use a stand-alone canonicalizer program.

-- 

Thanks,

--Raman(I Search, I Find, I Misplace, I Research)
?7?4 Id: kg:/m/0285kf1  ?0?8

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Can watermarking Unicode text using invisible differences sneak through Emacs, or can Emacs detect it?, (continued)
- Re: Can watermarking Unicode text using invisible differences sneak through Emacs, or can Emacs detect it?, Eli Zaretskii, 2022/01/19
- Re: Can watermarking Unicode text using invisible differences sneak through Emacs, or can Emacs detect it?, T.V Raman <=

Prev by Date: Re: Speeding up the bootstrap build - a quick hack.
Next by Date: Sv: Sv: Support for background transparency
Previous by thread: Re: Can watermarking Unicode text using invisible differences sneak through Emacs, or can Emacs detect it?
Next by thread: Re: master 702ce8dc3e: Morka handling of multi-hops in Tramp more consistent
Index(es):
- Date
- Thread