[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [PATCH v2] dfa: optimize UTF-8 period
From: |
Eric Blake |
Subject: |
Re: [PATCH v2] dfa: optimize UTF-8 period |
Date: |
Mon, 19 Apr 2010 16:48:42 -0600 |
User-agent: |
Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.9) Gecko/20100330 Fedora/3.0.4-1.fc12 Lightning/1.0b1 Thunderbird/3.0.4 |
On 04/19/2010 06:14 AM, Paolo Bonzini wrote:
> + /* A valid UTF-8 character is
> +
> + ([0x00-0x7f]
> + |[0xc2-0xdf][0x80-0xbf]
> + |[0xe0-0xef[0x80-0xbf][0x80-0xbf]
> + |[0xf0-f7][0x80-0xbf][0x80-0xbf][0x80-0xbf])
Yes, but in POSIX XBD 9.3.4,
http://www.opengroup.org/onlinepubs/9699919799/toc.htm, the ANYCHAR does
not match NUL. Do you need to adjust this patch to exclude 0x00?
--
Eric Blake address@hidden +1-801-349-2682
Libvirt virtualization library http://libvirt.org
signature.asc
Description: OpenPGP digital signature