|
From: | Makar |
Subject: | [bug #28275] Ranges like [a-z] incorrectly match in UTF systems |
Date: | Mon, 14 Dec 2009 20:57:33 +0000 |
User-agent: | Mozilla/5.0 (X11; U; Linux x86_64; ru-RU; rv:1.9.1.5) Gecko/20091129 Sabayon Firefox/3.5.5 |
Follow-up Comment #2, bug #28275 (project grep): No. It matches various non-ASCII symbols. Like ǹûṣőṏŭṋṽẚęčẉįļẹèĕểöâǩǝŏä For example type (on UTF-8 system): dd if=/dev/urandom bs=1024000 count=1 |iconv -c -f ucs-2 -t utf-8 > random-symbols.txt grep -oha '[a-z]' random-symbols.txt > 'random [a-z].txt' and you'll see what I mean. (file #19265, file #19266) _______________________________________________________ Additional Item Attachment: File name: random-symbols.txt Size:169 KB File name: random [a-z].txt Size:0 KB _______________________________________________________ Reply to this item at: <http://savannah.gnu.org/bugs/?28275> _______________________________________________ Message sent via/by Savannah http://savannah.gnu.org/
[Prev in Thread] | Current Thread | [Next in Thread] |