[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#17027: [PATCH] grep: prefer regex to DFA for ANYCHAR in non-UTF8 loc
From: |
Norihiro Tanaka |
Subject: |
bug#17027: [PATCH] grep: prefer regex to DFA for ANYCHAR in non-UTF8 locales |
Date: |
Tue, 18 Mar 2014 00:01:05 +0900 |
Package: grep
Tags: patch
When ANYCHAR is included in a pattern in non-UTF8 locales, grep prefer
to DFA engine to regex's. However, as long as I tested, even after have
applied Patch#17025, regex engine is slower than DFA's for ANYCHAR in
non-UTF8 locales.
This patch prefers regex to DFA for ANYCHAR in non-UTF8 locales.
Create the text.
$ yes abcd.abc | head -1000000 > m
I tested below before applying it.
$ time -p env LC_ALL=ja_JP.eucJP src/grep abcd.abd m
real 1.99
user 1.75
sys 0.28
I re-tested after applying it.
$ time -p env LC_ALL=ja_JP.eucJP src/grep abcd.abd m
real 1.21
user 0.71
sys 0.46
Norihiro
patch2.txt
Description: Text document
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- bug#17027: [PATCH] grep: prefer regex to DFA for ANYCHAR in non-UTF8 locales,
Norihiro Tanaka <=