[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: UTF-8 decoding error for characters U+10000 and above (hopefully fix
From: |
Kenichi Handa |
Subject: |
Re: UTF-8 decoding error for characters U+10000 and above (hopefully fixed already) |
Date: |
Mon, 13 Feb 2006 10:11:25 +0900 |
User-agent: |
SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI) |
In article <address@hidden>, Joe Wells <address@hidden> writes:
> I'm using the Gentoo ebuild app-editors/emacs-22.0.50_pre20050225
> which is based on a CVS snapshot from last year.
> Try evaluating this:
> (let ((unicode-char-hex-string
> (format "%x"
> (encode-char
> (aref (decode-coding-string
> ;; UTF-8 for U+1D161 (MUSICAL SYMBOL SIXTEENTH NOTE):
> "\355\205\241"
> 'utf-8) 0)
> 'ucs))))
> (if (equal "d161" unicode-char-hex-string)
> (error "Oh no! Emacs dropped 17th bit when decoding the
> character!")))
That version of Emacs supports only BMP as written in the
documenation of utf-8 coding system.
u -- utf-8 (alias of mule-utf-8)
UTF-8 encoding for Emacs-supported Unicode characters.
It supports Unicode characters of these ranges:
U+0000..U+33FF, U+E000..U+FFFF.
They correspond to these Emacs character sets:
ascii, latin-iso8859-1, mule-unicode-0100-24ff,
mule-unicode-2500-33ff, mule-unicode-e000-ffff
[...]
---
Kenichi Handa
address@hidden