bug-grep
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v2] dfa: optimize UTF-8 period


From: Eric Blake
Subject: Re: [PATCH v2] dfa: optimize UTF-8 period
Date: Mon, 19 Apr 2010 16:48:42 -0600
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.9) Gecko/20100330 Fedora/3.0.4-1.fc12 Lightning/1.0b1 Thunderbird/3.0.4

On 04/19/2010 06:14 AM, Paolo Bonzini wrote:
> +  /* A valid UTF-8 character is
> +
> +          ([0x00-0x7f]
> +           |[0xc2-0xdf][0x80-0xbf]
> +           |[0xe0-0xef[0x80-0xbf][0x80-0xbf]
> +           |[0xf0-f7][0x80-0xbf][0x80-0xbf][0x80-0xbf])

Yes, but in POSIX XBD 9.3.4,
http://www.opengroup.org/onlinepubs/9699919799/toc.htm, the ANYCHAR does
not match NUL.  Do you need to adjust this patch to exclude 0x00?

-- 
Eric Blake   address@hidden    +1-801-349-2682
Libvirt virtualization library http://libvirt.org



Attachment: signature.asc
Description: OpenPGP digital signature


reply via email to

[Prev in Thread] Current Thread [Next in Thread]