[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#24975: Matching issues with characters whose encoding ends in some o
From: |
Norihiro Tanaka |
Subject: |
bug#24975: Matching issues with characters whose encoding ends in some other character |
Date: |
Mon, 28 Nov 2016 23:47:57 +0900 |
Jim Meyering <address@hidden> wrote:
> I suspect this won't be the last word in this area, because it feels
> like we should be able to adjust DFA's tables so that people using
> such locales can retain DFA's efficiency without the bug in the
> current implementation.
Hi Jim,
It is a bug in dfa for period expression in non-UTF8 locales. dfa
calculates transition for single byte characters and a multibyte
character separately and merge both results. However, if backs to
an initial state in transition for single byte characters, we should
stop matching single byte characters.
Thanks,
Norihiro
0001-dfa-avoid-match-middle-in-multibyte-character.patch
Description: Text document