bug-gawk
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [bug-gawk] GAWK for Windows does not work properly with UTF-8


From: Eli Zaretskii
Subject: Re: [bug-gawk] GAWK for Windows does not work properly with UTF-8
Date: Wed, 17 Feb 2016 22:07:14 +0200

> Date: Wed, 17 Feb 2016 20:40:56 +0100
> From: Marc de Bourget <address@hidden>
> 
> Hello Eli, do you have a little tipp how to use multibyte characters in
> character classes correctly, also combinated with negation [^èé][a-z]?
> I have always thought that pattern matching is no problem with UTF-8,
> but character classes seem to be a problem. Are there workarounds?
> Can you please help me, please? THANK YOU VERY MUCH! 

I don't see how this could work on Windows, as long as you must encode
the files in UTF-8.  One workaround is to recode the files in
something like codepage-1252, and if your system codepage is
different, then use the chcp command to switch to that codepage before
running Gawk.  Then I expect the above matching to work as expected.

If you are lucky, and all of the characters you need to match can be
encoded in your system codepage, then that's what I would suggest
doing.



reply via email to

[Prev in Thread] Current Thread [Next in Thread]