[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Case insensitivity seems to ignore lower bound of interval
From: |
Eric Bischoff |
Subject: |
Case insensitivity seems to ignore lower bound of interval |
Date: |
Tue, 26 Apr 2011 17:27:49 +0200 |
User-agent: |
KMail/1.13.6 (Linux/2.6.38-8-generic; KDE/4.6.2; x86_64; ; ) |
Hi all,
$ echo "ijklmnopqrstuvwxyz" | awk '{ gsub(/[R-Z}/, "X"); print }
ijklmnopqrXXXXXXXX
please notice that "r" is not matched, i.e. case insensitivity is applied only
to [S-Z] interval.
$ awk --version
GNU Awk 3.1.7
(...)
$ echo $LANG
fr_FR.UTF-8
The problem does not appear when locale is C.
The problem does not appear when interval is specified as [r-z] (lower case)..
This contradicts http://www.gnu.org/software/gawk/manual/gawk.html#Locales
which documents
$ echo something1234abc | gawk '{ sub("[A-Z]*$", ""); print }'
as returning
something1234
while it returns
something1234a
Bug reproduced both on Ubuntu Natty beta 2 and on Fedora 15.
I hope that helps,
--
Éric Bischoff - Bureau Cornavin
Technical writing and translations
http://www.bureau-cornavin.com
(+33) 3 68 46 00 85
sip:address@hidden
- Case insensitivity seems to ignore lower bound of interval,
Eric Bischoff <=