bug-grep
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Bug#468912: grep: -z is broken under UTF-8 locales


From: Aníbal Monsalve Salazar
Subject: Re: Bug#468912: grep: -z is broken under UTF-8 locales
Date: Sun, 29 Mar 2009 17:12:37 +1100
User-agent: Mutt/1.5.18 (2008-05-17)

forwarded 468912 address@hidden
thanks

On Sun, Jan 11, 2009 at 10:43:51AM -0500, Ruben Molina wrote:
>On Sun, Mar 02, 2008 at 11:46:41AM +0100, Jakub Wilk wrote:
>>Package: grep
>>Version: 2.5.3~dfsg-3
>>Severity: normal
>>
>>$ export LC_ALL=POSIX
>>$ locale
>>LANG=
>>LC_CTYPE="POSIX"
>>LC_NUMERIC="POSIX"
>>LC_TIME="POSIX"
>>LC_COLLATE="POSIX"
>>LC_MONETARY="POSIX"
>>LC_MESSAGES="POSIX"
>>LC_PAPER="POSIX"
>>LC_NAME="POSIX"
>>LC_ADDRESS="POSIX"
>>LC_TELEPHONE="POSIX"
>>LC_MEASUREMENT="POSIX"
>>LC_IDENTIFICATION="POSIX"
>>LC_ALL=POSIX
>>$ printf 'x\0y\0x\0' | grep -z '^x'; echo
>>xx
>>
>>$ export LC_ALL=en_US.utf8
>>$ locale
>>LANG=
>>LC_CTYPE="en_US.utf8"
>>LC_NUMERIC="en_US.utf8"
>>LC_TIME="en_US.utf8"
>>LC_COLLATE="en_US.utf8"
>>LC_MONETARY="en_US.utf8"
>>LC_MESSAGES="en_US.utf8"
>>LC_PAPER="en_US.utf8"
>>LC_NAME="en_US.utf8"
>>LC_ADDRESS="en_US.utf8"
>>LC_TELEPHONE="en_US.utf8"
>>LC_MEASUREMENT="en_US.utf8"
>>LC_IDENTIFICATION="en_US.utf8"
>>LC_ALL=en_US.utf8
>>$ printf 'x\0y\0x\0' | grep -z '^x'; echo
>>x
>>
>>-- System Information:
>>Debian Release: lenny/sid
>>   APT prefers testing
>>   APT policy: (900, 'testing'), (600, 'unstable'), (500, 'experimental')
>>Architecture: i386 (i686)
>>
>>Kernel: Linux 2.6.24-1-686 (SMP w/2 CPU cores)
>>Locale: LANG=C, LC_CTYPE=pl_PL.utf8 (charmap=UTF-8)
>>Shell: /bin/sh linked to /bin/dash
>>
>>Versions of packages grep depends on:
>>ii  libc6                         2.7-6      GNU C Library: Shared libraries
>>
>>grep recommends no packages.
>>
>>-- no debconf information
>>
>>-- 
>>Jakub Wilk
>
>tags  468912  + confirmed
>found 468912 2.5.3~dfsg-6
>thanks
>
>$ locale
>LANG=es_CO.UTF-8
>LC_CTYPE="es_CO.UTF-8"
>LC_NUMERIC="es_CO.UTF-8"
>LC_TIME="es_CO.UTF-8"
>LC_COLLATE="es_CO.UTF-8"
>LC_MONETARY="es_CO.UTF-8"
>LC_MESSAGES="es_CO.UTF-8"
>LC_PAPER="es_CO.UTF-8"
>LC_NAME="es_CO.UTF-8"
>LC_ADDRESS="es_CO.UTF-8"
>LC_TELEPHONE="es_CO.UTF-8"
>LC_MEASUREMENT="es_CO.UTF-8"
>LC_IDENTIFICATION="es_CO.UTF-8"
>LC_ALL=
>
>$ printf 'x\0y\0x\0' | grep -z '^x'; echo
>x
>

I can reproduce this bug with 2.5.4

grep -V
GNU grep 2.5.4

printf 'x\0y\0x\0' | grep -z '^x'; echo; echo $?
xx
0

LANG=en_AU.UTF-8 printf 'x\0y\0x\0' | LANG=en_AU.UTF-8 grep -z '^x'; echo; echo 
$?
x
0




reply via email to

[Prev in Thread] Current Thread [Next in Thread]