[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#25455: uniq considers all the full-width punctuation and Japanese ka
From: |
Mingye Wang (Arthur2e5) |
Subject: |
bug#25455: uniq considers all the full-width punctuation and Japanese kana as the same under zh_CN.UTF-8 locale |
Date: |
Tue, 17 Jan 2017 18:22:48 +0000 |
15.01.2017, 20:01, "Icenowy Zheng" <address@hidden>:
> Problem:
> When dealing lines with only a Chinese full-width punctuation or Japanese kana
> and locale is zh_CN.UTF-8, uniq command will consider all the lines are the
> same, and wrongly removed different punctuations.
To narrow the scope down a bit, I should mention that LC_COLLATE is enough to
trigger the bug:
printf '%s\n' 。 , ? ! a b c | LC_COLLATE=zh_CN.UTF-8 uniq
--
Regards,
Arthur2e5