[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: uniq prints invalid unique lines multiple times
From: |
Paul Eggert |
Subject: |
Re: uniq prints invalid unique lines multiple times |
Date: |
23 Feb 2004 14:59:45 -0800 |
User-agent: |
Gnus/5.09 (Gnus v5.9.0) Emacs/21.3 |
Reuben Thomas <address@hidden> writes:
> What I expect when I run with LANG=en_GB.UTF-8 is either for uniq to
> return an error (because the file is not valid text), or to print
> one single line (if it's being lenient).
Can you please try coreutils-5.2.0? It has some patches in this area.
ftp://ftp.gnu.org/gnu/coreutils/coreutils-5.2.0.tar.gz
If coreutils-5.2.0 is also broken for you, can you please use GDB to
place a breakpoint on the xmemcoll function and see why it is not
printing an error message for you?
It is possible that you're seeing a bug in your C library's
implementation of strcoll. If strcoll is returning a nonzero value
for identical strings, and is not setting errno to indicate an
encoding error exists in the strings, that would explain your
symptoms. That's why I'm asking for the GDB debugging, if you still
observe the bug in 5.2.0.