[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Enhancement request to wc
From: |
Neo Anderson |
Subject: |
Re: Enhancement request to wc |
Date: |
Sun, 8 Feb 2009 06:28:04 -0800 (PST) |
Not very sure whether I can send attach file or not. In this mail I've sent an
attached file (file name is sample.big5), in which it contains 4 Traditional
Chinese words encoded in BIG5. There is no white space between these 4 words.
The content looks like "中文測試".
The env I use is GNU/ Debian Lenny; kernel 2.6.27.8; gcc version 4.3.2 (Debian
4.3.2-1.1); wc (GNU coreutils) 6.10; LANG=en_US.UTF-8 (Other locale settings
e.g. LC_CYTPE are all en_US.UTF-8)
Please let me know if the file fails to attached or needs to upload to
somewhere else.
Thanks for your help,
--- On Sat, 7/2/09, Pádraig Brady <address@hidden> wrote:
> From: Pádraig Brady <address@hidden>
> Subject: Re: Enhancement request to wc
> To: address@hidden
> Cc: address@hidden
> Date: Saturday, 7 February, 2009, 8:00 AM
> Neo Anderson wrote:
> > Hi
> >
> > I read the page at
> http://www.gnu.org/software/coreutils/, saying that the
> enhancement request can go through this mailing list.
> >
> > My request is that I would like wc can also count
> multi bytes characters e.g Chinese Big5 correctly.
> >
> > Please let me know if any additional information
> required.
>
> We're starting work on general multibyte support for
> coreutils,
> but `wc` should already be pretty good in the regards.
>
> Could you provide a small example Big5 encoded file
> and expected output. Also it would help if you provided
> the version of sort, your operating system version
> and what locale you;re using.
>
> thanks,
> Pádraig.
>
>
> _______________________________________________
> Bug-coreutils mailing list
> address@hidden
> http://lists.gnu.org/mailman/listinfo/bug-coreutils
sample.big5
Description: Binary data