[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Emacs-diffs] master db828f6: Don't rely on defaults in decoding UTF

From: Paul Eggert
Subject: Re: [Emacs-diffs] master db828f6: Don't rely on defaults in decoding UTF-8 encoded Lisp files
Date: Sun, 27 Sep 2015 13:21:51 -0700
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.2.0

Eli Zaretskii wrote:
This is unrelated: it specifies which character sequences should be
composed and displayed as a single grapheme cluster.

Yes. It might be reasonable to replace some of those \u instances for readability, e.g.:

-          ("V" . "[\u0904-\u0914\u0960-\u0961\u0972]") ; independent vowel
+          ("V" . "[ऄ-औॠ-ॡॲ]") ; independent vowel

But replacements would not be such a good idea for some of this code, e.g.:

-          ("H" . "\u094D")         ; HALANT
+          ("H" . "्")              ; HALANT

as standalone combining characters are problematic on display, and here:

-          ("J" . "\u200D")         ; ZWJ
+          ("J" . "‍")              ; ZWJ

where one can't easily see a zero width joiner when editing the source file. I expect that whoever wrote that code felt more comfortable sticking with \u escapes uniformly, rather than using \u sometimes and not other times.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]