[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: bug in join: case comparisons don't work in multibyte locales
From: |
Pádraig Brady |
Subject: |
Re: bug in join: case comparisons don't work in multibyte locales |
Date: |
Wed, 11 Mar 2009 02:55:09 +0000 |
User-agent: |
Thunderbird 2.0.0.6 (X11/20071008) |
Pádraig Brady wrote:
> Bruno Haible wrote:
>> Hi Jim,
>
> Thanks for looking at this Bruno.
>
>> In coreutils/src/join.c, there is a FIXME mentioning that the -i option for
>> case insensitive comparison of the input lines does not work in multibyte
>> locales.
>
> Utils that have this issue are:
> join -i, uniq -i, sort -f, ptx -f
Note as well as folding case I think it might
be useful to fold other forms like:
Diacritics: À -> A
Enclosed: \u24b6 -> A
Stylistic: \uff21-> A
I.E. have more general function like:
ulc_coll(fold={Case|Diactritics|Stylistic}, ...);
I'm not suggesting it's required or anything,
just something to keep in mind while designing
these interfaces.
cheers,
Pádraig.