Re: Using libunistring for string comparisons et al

guile-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Using libunistring for string comparisons et al

From:	Mark H Weaver
Subject:	Re: Using libunistring for string comparisons et al
Date:	Thu, 17 Mar 2011 11:38:01 -0400
User-agent:	Gnus/5.13 (Gnus v5.13) Emacs/23.3 (gnu/linux)

I have a compromise proposal, which could be implemented for 2.0.x:

We keep wide (UTF-32) stringbufs as-is, but we change narrow stringbufs
to UTF-8, along with a flag that indicates whether it is known to be
ASCII-only.

Applying string-ref or string-set! to a narrow stringbuf would upgrade
it to a wide stringbuf, unless it is known to be ASCII-only.  Better
yet, string-ref should do this only when the index is above a certain
threshold value, and string-set! should do this only for stringbufs
longer than a certain threshold length.

This would keep our accessors O(1), but also ensure that most stringbufs
are narrow.  This is important not only for optimal memory usage, but
also because it means we don't have to worry so much about optimizing
the narrow-wide cases: then we can handle those cases by widening or
narrowing to make them the same width, and then calling libunistring.

In the eventual common case, where string-ref and string-set! are rarely
called, almost all stringbufs would be narrow, so converting to UTF-8
becomes an O(1) operation.

What do you think?

    Mark

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Using libunistring for string comparisons et al, Mike Gran, 2011/03/12
- Re: Using libunistring for string comparisons et al, Mark H Weaver, 2011/03/15
  - Re: Using libunistring for string comparisons et al, Mike Gran, 2011/03/15
    - Re: Using libunistring for string comparisons et al, Mark H Weaver, 2011/03/15
    - Re: Using libunistring for string comparisons et al, Mike Gran, 2011/03/15
    - Re: Using libunistring for string comparisons et al, Mark H Weaver, 2011/03/15
    - Re: Using libunistring for string comparisons et al, Ludovic Courtès, 2011/03/16
    - Re: Using libunistring for string comparisons et al, Mark H Weaver <=
    - Re: Using libunistring for string comparisons et al, Ludovic Courtès, 2011/03/17
    - Re: Using libunistring for string comparisons et al, Mark H Weaver, 2011/03/17
    - Re: Using libunistring for string comparisons et al, Thien-Thi Nguyen, 2011/03/17
    - Re: Using libunistring for string comparisons et al, Mark H Weaver, 2011/03/17
    - Re: Using libunistring for string comparisons et al, Thien-Thi Nguyen, 2011/03/18
    - Re: Using libunistring for string comparisons et al, Mark H Weaver, 2011/03/18
    - Re: Using libunistring for string comparisons et al, Ludovic Courtès, 2011/03/20
    - Re: Using libunistring for string comparisons et al, Andy Wingo, 2011/03/30
    - Re: Using libunistring for string comparisons et al, Ludovic Courtès, 2011/03/17
    - Re: Using libunistring for string comparisons et al, Andy Wingo, 2011/03/19

Prev by Date: enhancement requests
Next by Date: Re: Using libunistring for string comparisons et al
Previous by thread: Re: Using libunistring for string comparisons et al
Next by thread: Re: Using libunistring for string comparisons et al
Index(es):
- Date
- Thread