[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: ucs-normalize and diacritics

From: Eli Zaretskii
Subject: Re: ucs-normalize and diacritics
Date: Fri, 03 Aug 2018 20:57:21 +0300

> From: address@hidden (K. Handa)
> Cc: address@hidden, address@hidden
> Date: Fri, 03 Aug 2018 23:21:54 +0900
> In article <address@hidden>, Eli Zaretskii <address@hidden> writes:
> > I meanwhile removed tis620-2533 on master and made it an alias to
> > thai-iso8859-11.  It looked to me that a difference of one character
> > doesn't justify to have another charset with peculiar traits which in
> > addition causes unintuitive behavior.  Do you think that change is OK?
> I think it is not the right way.  Currently,
> (encode-char #xa0 'tis620-2533) => nil
> (encode-char #xa0 'thai-iso8859-11) => 32
> But, your change results in:
> (encode-char #xa0 'tis620-2533) => 32
> which is simply wrong.

How important is that discrepancy?  After all, tis620-2533 doesn't
have the #xa0 character, so the situation above should never happen,
right?  OTOH, it is strange to have two charsets that are identical
except for one character.

> That unintuitive behavior you mentioned is just Emacs' fault, and can be
> fixed, for instance, by my previous patch.

Yes, it could be fixed, but why do we need to jump through hoops to
keep one more charset, when we already have a charset that is
identical to it except for one character?

reply via email to

[Prev in Thread] Current Thread [Next in Thread]