[bug-gawk] Problem with substr() after match() with non-ASCII characters

bug-gawk

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug-gawk] Problem with substr() after match() with non-ASCII characters

From:	Janis Papanagnou
Subject:	[bug-gawk] Problem with substr() after match() with non-ASCII characters
Date:	Sat, 22 Aug 2015 22:33:52 +0200

The issue was observed using GNU awk 4.1.2 and confirmed to show the
same behaviour in GNU awk 4.1.3.

With the attached program 'testprog' applied on the attached data 'testdata'
I do *not* get the expected result of four lines containing "2007" each, but
instead I get:

2007
0703
2007
0071

The problem is caused/triggered by non-ASCII characters in 'testdata'.

Note: I can run 'testprog' it with LC_ALL=C and the output is as expected.

My understanding is, though, that the implicit results from the match()
function, RSTART and RLENGTH, should be consistently usable in substr(),
independent of the locale setting.

Thanks!

Janis

testdata
Description: Binary data

testprog
Description: Binary data

[Prev in Thread]

Current Thread

[Next in Thread]

[bug-gawk] Problem with substr() after match() with non-ASCII characters, Janis Papanagnou <=
- Re: [bug-gawk] Problem with substr() after match() with non-ASCII characters, Stephane Chazelas, 2015/08/24
  - Re: [bug-gawk] Problem with substr() after match() with non-ASCII characters, Aharon Robbins, 2015/08/24
    - Re: [bug-gawk] Problem with substr() after match() with non-ASCII characters, Hermann Peifer, 2015/08/24
    - Re: [bug-gawk] Problem with substr() after match() with non-ASCII characters, Aharon Robbins, 2015/08/31
- Re: [bug-gawk] Problem with substr() after match() with non-ASCII characters, Aharon Robbins, 2015/08/24
- Re: [bug-gawk] Problem with substr() after match() with non-ASCII characters, Aharon Robbins, 2015/08/31

Prev by Date: Re: [bug-gawk] Potential errors in gawk mANUAL 4.1 aPRIL 2015
Next by Date: Re: [bug-gawk] Problem with substr() after match() with non-ASCII characters
Previous by thread: [bug-gawk] Potential errors in gawk mANUAL 4.1 aPRIL 2015
Next by thread: Re: [bug-gawk] Problem with substr() after match() with non-ASCII characters
Index(es):
- Date
- Thread