[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Using libunistring for string comparisons et al

From: Thien-Thi Nguyen
Subject: Re: Using libunistring for string comparisons et al
Date: Fri, 18 Mar 2011 01:10:40 +0100
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.50 (gnu/linux)

() Mark H Weaver <address@hidden>
() Thu, 17 Mar 2011 13:58:42 -0400

   * regexp search: The search itself can be implemented bytewise, exactly
     as if it was a fixed-width encoding.  Compiling the regexp can
     _almost_ be implemented as if the UTF-8-encoded regexp was in a
     fixed-width encoding, with just one added complication: a multibyte
     character followed by `*', `?' etc, must be compiled in such a way
     that the suffix operator applies to the whole character, and not just
     its final byte.  (In practice, it's probably more straightforward to
     handling compiling somewhat differently than outlined here, but you
     get the idea).

In unibyte land, "." matches a byte.  OK.

In multibyte land done "bytewise", "." matches ____________.
(What goes in the blank?)

reply via email to

[Prev in Thread] Current Thread [Next in Thread]