bug-coreutils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#31033: [PATCH] Multibyte support for sort, uniq, join, tr, cut, past


From: Eric Fischer
Subject: bug#31033: [PATCH] Multibyte support for sort, uniq, join, tr, cut, paste, expand, unexpand
Date: Mon, 2 Apr 2018 15:18:14 -0700

As previously discussed on the coreutils mailing list, beginning with

  http://lists.gnu.org/archive/html/coreutils/2017-12/msg00074.html

most of the coreutils text processing commands process bytes instead of
characters, regardless of the user's locale, so they do not handle UTF-8
text or options properly.

I propose the changes in

  https://github.com/ericfischer/coreutils/compare/multibyte-squash

to convert sort, uniq, join, tr, cut, paste, expand, and unexpand to
process characters instead of bytes, allowing them to work correctly on
non-ASCII text, as specified by POSIX.

Eric Fischer


reply via email to

[Prev in Thread] Current Thread [Next in Thread]