bug-grep
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: character ranges in regular expressions


From: Paolo Bonzini
Subject: Re: character ranges in regular expressions
Date: Fri, 24 Sep 2010 07:50:08 +0200

On Thu, Sep 23, 2010 at 23:55, Bruno Haible <address@hidden> wrote:
> In regcomp.c and regexec.c the _NL_COLLATE_COLLSEQWC field of the LC_COLLATE
> part of the locale is encoded in this way. In 
> glibc/locale/programs/ld-collate.c
> this field is being constructed from a table called 'collate->wcseqorder'.
> The role of this table is to be used in regular expression matching and
> wildcard matching. The table is derived from (but does not represent the
> entire information from) the LC_COLLATE portion of the locale input file.

Yes, this is what I'm curious about.  Why does the table have the
order A..Za..z for en_US.UTF-8 and aAbB...yYzZ for cs_CZ.UTF-8, even
though strcoll uses the latter in both locales?

Paolo



reply via email to

[Prev in Thread] Current Thread [Next in Thread]