bug-gnu-utils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Bug in [...]* matching with acute-u


From: Hans-Bernhard Broeker
Subject: Re: Bug in [...]* matching with acute-u
Date: 29 Jan 2001 17:24:33 GMT

Jorge Stolfi <address@hidden> wrote:

> I think I have run into a bug in gawk's handling of REs of the
> form [...]* when the bracketed list includes certain 8-bit characters,
> specifically u-acute (octal \372).

> The problem occurs in GNU Awk 3.0.4, both under 
> Linux 2.2.14-5.0 (intel i686) and SunOS 5.5 (Sun sparc).

Two comments to this:

1) 3.0.4 is an old version. If you report bugs in open-source programs,
you should always do so with respect to the most currently available
version. In the case at hand, that's 3.0.6, from August 2000

2) I'm not sure at all that AWK is, or has ever been claimed to be, an
8-bit clean program. The behaviour you describe quite nicely fits what
I expect would happen if that was the root of the problem. \372
probably is the internal token or state number of some non-terminal
element in the grammar.
-- 
Hans-Bernhard Broeker (address@hidden)
Even if all the snow were burnt, ashes would remain.



reply via email to

[Prev in Thread] Current Thread [Next in Thread]