[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
major gawk bug
From: |
Stanislav Ievlev |
Subject: |
major gawk bug |
Date: |
Tue, 8 Jun 2004 15:51:19 +0400 |
Hello friends!
Why gawk uses setlocale(), but have a hardcoded table (const char casetable[] )
for case-independent regexp matching?
This table is correct for latin1 charset only, but incorrect for others,
e.g. for KOI8-R (russian).
KOI8-R encoding is fully compatible with 7-bit ASCII (so gawk compiles well),
but has other symbols for codes greater then 128.
So gawk supports only latin1, but ignore cp1251,koi8-r,koi8-u, etc.
As I understand, it's not a problem to fill this table with locale
specific symbols at start.
--
With best regards
Stanislav Ievlev
ALT Linux Team.
- major gawk bug,
Stanislav Ievlev <=