[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: More Cyrillic vs UTF-8

From: Benjamin Riefenstahl
Subject: Re: More Cyrillic vs UTF-8
Date: 29 Apr 2003 18:17:05 +0200
User-agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.2

Hi Richard,

>     > Unsupported:
>     >
>     > CJK Unified Ideographs Extension A (1.5MB)
>     > CJK Unified Ideographs (5MB)
>     > Yi Syllables
>     > Yi Radicals
>     > Hangul Syllables (7MB)
>     > High Surrogates
>     > Low Surrogates
>     > Old Italic
>     > Gothic
>     > Deseret
>     > Byzantine Musical Symbols
>     > Musical Symbols
>     > Mathematical Alphanumeric Symbols
>     > CJK Unified Ideographs Extension B (13MB)
>     > CJK Compatibility Ideographs Supplement
>     > Tags
>     > Supplementary Private Use Area-A
>     > Supplementary Private Use Area-B
>     It seems that these might be summarized by CJK, Music, Maths,
>     Private Use Area.

Richard Stallman <address@hidden> writes:
> I don't know what "Surrogates" are.  Also, Old Italic and Gothic do
> not fit in that list.  What are "Tags"?

"Surrogates" are the codes that are used in UTF-16 to encode
characters with code points above \uFFFF.

"Tags" are codes used for in-band language tagging.

> Also, I am not sure whether ALL CJK characters are included here.
> For instance, are Hangul letters included here?

Kana, Bopomofo and some CJK compatibility and special symbols are
below \u03FF and/or above \uE000, but the major part of the CJK and
all of Hangul is unsupported.

so long, benny

reply via email to

[Prev in Thread] Current Thread [Next in Thread]