emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: UCS-2BE


From: Kenichi Handa
Subject: Re: UCS-2BE
Date: Fri, 01 Sep 2006 20:28:02 +0900
User-agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI)

In article <address@hidden>, Andreas Schwab <address@hidden> writes:

> The above quote is talking about "coded in N octets".  If that's not about
> serialisation, what else is it?

To my understanding, it means 8*N bits here, and the wording
"UCS-4 (Universal Character Set coded in 4 octets)" is just
for explaining from where the the literal "UCS-4" comes.

See this description in C.2.

"As a consequence, UCS-4 can now be taken effectively as an
alias for the Unicode encoding form UTF-32, ..."

So, apparently UCS-4 is CEF here.

By the way, Unicode itself is confusing in names.  For
instance, UTF-32 means both "UTF-32 encoding form (CEF)" and
"UTF-32 encoding scheme (CES)".  Unicode 4.1 says:

"For historical reasons, the Unicode encoding schemes are
also referred to as Unicode (or UCS) transformation formats
(UTF). That term is, however, ambiguous between its usage
for encoding forms and encoding schemes."

---
Kenichi Handa
address@hidden




reply via email to

[Prev in Thread] Current Thread [Next in Thread]