--- Begin Message ---
Subject: |
\w is not synonym for [[:alnum:]] in UTF-8 locales |
Date: |
Fri, 24 Oct 2014 10:19:49 -0400 (EDT) |
Hi,
in the man page there is the following sentence:
"The symbol \w is a synonym for [_[:alnum:]] and \W is a synonym for
[^_[:alnum:]]"
Not counting that in man pages for some other languages (e.g. czech) there is
written
that \w is a synonym for [[:alnum:]] and \W is a synonym for [^[:alnum:]], but
none of them seems to be synonym for \w | \W in UTF-8 locales:
$ export LANG=en_US.UTF-8
$ echo 'á' | grep '[[:alnum:]]'
á
$ echo 'á' | grep '[_[:alnum:]]'
á
$ echo 'á' | grep '\w'
$ echo 'á' | grep '[^[:alnum:]]'
$ echo 'á' | grep '[^_[:alnum:]]'
$ echo 'á' | grep '\W'
á
$ grep --version
grep (GNU grep) 2.20
...
--- End Message ---
--- Begin Message ---
Subject: |
Re: bug#18817: \w is not synonym for [[:alnum:]] in UTF-8 locales |
Date: |
Tue, 28 Oct 2014 20:55:07 -0700 |
FYI, I noticed only after pushing that "make check" was
failing a test because that new script was not executable,
so I've just pushed a follow-up patch to fix that.
On Tue, Oct 28, 2014 at 6:07 PM, Jim Meyering <address@hidden> wrote:
> I've adjusted the commit subject and ChangeLog content, and will push
> this today, then I'll make a pre-release snapshot.
--- End Message ---