[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: cannot encode a buffer??

From: Stephen J. Turnbull
Subject: Re: cannot encode a buffer??
Date: Wed, 25 Jun 2003 20:22:22 +0900
User-agent: Gnus/5.1001 (Gnus v5.10.1) XEmacs/21.4 (Portable Code, linux)

>>>>> "Kenichi" == Kenichi Handa <address@hidden> writes:

    Kenichi> If you force saving them by utf-8, the encoder generates
    Kenichi> a utf-8 byte sequence corresponding to U+FFFD for each of
    Kenichi> them.

Isn't that a violation of the Unicode standard?

I agree that the preferences of those who would rather that Emacs keep
the different flavors of Han different should be respected.  FWIW, I'd
default `utf-translate-cjk-mode' to on (to encourage development of a
Unicode-based way to disambiguate Unihan), but that does risk a lot of
annoyance for Asian polyglots.

However, if somebody insists on saving as UTF-8, the result should be
unification of Japanese to Chinese (which after all can be read, if
you can read both languages), not destruction of text.  Ie, it should
not be possible for Emacs to convert any JIS X 0208 character to
U+FFFD, ever.

Institute of Policy and Planning Sciences     http://turnbull.sk.tsukuba.ac.jp
University of Tsukuba                    Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
               Ask not how you can "do" free software business;
              ask what your business can "do for" free software.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]