emacs-pretest-bug
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: UTF-8 decoding error for characters U+10000 and above (hopefully fix


From: Kenichi Handa
Subject: Re: UTF-8 decoding error for characters U+10000 and above (hopefully fixed already)
Date: Mon, 13 Feb 2006 10:11:25 +0900
User-agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI)

In article <address@hidden>, Joe Wells <address@hidden> writes:

> I'm using the Gentoo ebuild app-editors/emacs-22.0.50_pre20050225
> which is based on a CVS snapshot from last year.

> Try evaluating this:

>   (let ((unicode-char-hex-string
>          (format "%x"
>                  (encode-char
>                   (aref (decode-coding-string
>                          ;; UTF-8 for U+1D161 (MUSICAL SYMBOL SIXTEENTH NOTE):
>                          "\355\205\241"
>                          'utf-8) 0)
>                   'ucs))))
>     (if (equal "d161" unicode-char-hex-string)
>         (error "Oh no!  Emacs dropped 17th bit when decoding the 
> character!")))

That version of Emacs supports only BMP as written in the
documenation of utf-8 coding system.

u -- utf-8 (alias of mule-utf-8)

UTF-8 encoding for Emacs-supported Unicode characters.
It supports Unicode characters of these ranges:
    U+0000..U+33FF, U+E000..U+FFFF.
They correspond to these Emacs character sets:
    ascii, latin-iso8859-1, mule-unicode-0100-24ff,
    mule-unicode-2500-33ff, mule-unicode-e000-ffff
[...]

---
Kenichi Handa
address@hidden




reply via email to

[Prev in Thread] Current Thread [Next in Thread]