bug-grep
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: grep for unicode text files?


From: eo rojas
Subject: Re: grep for unicode text files?
Date: Mon, 27 Apr 2009 18:14:13 -0700

Clearly, I don't know.  Next time I have a moment I'll try to figure that
out.
Are LC_ALL, or LANG compilation parameters?  If so I could at least build
myself a grep-16 that does the job.

Is there any interest in making grep recognize the character set?  I might
be willing to do that, if I'm not just the lone wolf.  I think the right why
to do it would be to add a parameter, e.g. ---utf-16 and have grep exec
grep-16 to handle it.  This way it would not impinge on performance and be
backward compatible.  The overhead of checking the first two characters in
the file down't seem that high, but the program would need two each search
routine or something worse.



On Mon, Apr 27, 2009 at 4:46 PM, Norihiro Tanaka <address@hidden> wrote:

> Hi,
>
>   Unlike the VI editor (VIM), Grep doesn't automatically recognize
> character set of a text. You need to set locale and character set to
> LC_ALL, LANG, etc. Can Cygwin understand utf-16 ?
>
>
>
>


-- 
------
emilio


reply via email to

[Prev in Thread] Current Thread [Next in Thread]