Re: master e39cb515a10 1/4: Correctly handle non-BMP characters in Andro

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: master e39cb515a10 1/4: Correctly handle non-BMP characters in Andro

From:	Po Lu
Subject:	Re: master e39cb515a10 1/4: Correctly handle non-BMP characters in Android content file names
Date:	Sat, 23 Mar 2024 20:11:26 +0800
User-agent:	Gnus/5.13 (Gnus v5.13)

Eli Zaretskii <eliz@gnu.org> writes:

> AFAIU, this is because Java uses UTF-16 encoded strings to support
> Unicode, is that right?  If so, why not use encode-coding and
> decode-coding to en/decode between UTF-16 and the internal
> representation?  AFAIR, we want to deprecate CCL, and thus using it in
> new code should be avoided.

I think you've misunderstood that code.  Java communicates with C using
a custom character encoding that, while resembling UTF-8, encodes
characters that UTF-8 represents with 4-byte sequences as 3-byte
sequences of surrogate pairs, and the NULL character as a special
two-byte sequence, and it is this unique (i.e. underivable) coding
system that is being defined here.  It wouldn't be wise to remove CCL
until some better means of defining custom coding systems comes into
existence, and when it does I'll be as glad as you to see it replace
CCL, but until then there's not really an alternative short of
implementing it in coding.c.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: master e39cb515a10 1/4: Correctly handle non-BMP characters in Android content file names, Eli Zaretskii, 2024/03/23
- Re: master e39cb515a10 1/4: Correctly handle non-BMP characters in Android content file names, Po Lu <=

Prev by Date: Re: [PATCH] Implement fast verisons of json-parse functions
Next by Date: Re: recursion to iteration macro
Previous by thread: Re: master e39cb515a10 1/4: Correctly handle non-BMP characters in Android content file names
Next by thread: csv-mode: short-cut to set header line?
Index(es):
- Date
- Thread