[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Unicode and Guile

From: Marius Vollmer
Subject: Re: Unicode and Guile
Date: Wed, 12 Nov 2003 01:29:29 +0100
User-agent: Gnus/5.1002 (Gnus v5.10.2) Emacs/21.3 (gnu/linux)

Tom Lord <address@hidden> writes:

>     ~ (grapheme=? g1 g2 [locale]) => <boolean>
>     ~ (grapheme<? g1 g2 [locale])
>     ~ (grapheme>? g1 g2 [locale])
>     [...]
>     ~ (grapheme-ci=? g1 g2 [locale])
>     ~ (grapheme-ci<? g1 g2 [locale])
>     ~ (grapheme-ci>? g1 g2 [locale])
>       The usual orderings.

Is it a good idea to have an ordering among graphemes, or would it be
better to only order texts, i.e., to allow for the context of a
grapheme to determine the order?

>     ~ (make-text-marker text index) => <marker>

What about having _only_ markers and not allow integers as indices?
Also, what about making TEXTs unmutable by default and instead let
TEXT-REPLACE, etc return a new text object?

>   The new TEXT? and GRAPHEME? types present a simple and traditional
>   interface to "conceptual strings" and "conceptual characters".  
>   They make it easy to express simple algorithms simply and without
>   reference to the internal structure of Unicode.


>   There is no essential difference between a grapheme and a text
>   object of length 1, and thus the proposal makes GRAPHEME? a 
>   subtype of TYPE.

Do we need the concept of grapheme at all, then?

> The proposal also makes it possible to pass strings everywhere that
> text can be used.   I think that's the more interesting direction: 
> just use text- and grapheme- procedures from now on except where you
> _really_ want to refer to octets.

Could we make strings/chars go away completely over time?  For vectors
of octets, there is u8vector? from SRFI-4.

GPG: D5D4E405 - 2F9B BCCC 8527 692A 04E3  331E FAF8 226A D5D4 E405

reply via email to

[Prev in Thread] Current Thread [Next in Thread]