[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[patch #6899] Speed-up for searching in multibyte and ignore-icase.
From: |
Norihirio Tanaka |
Subject: |
[patch #6899] Speed-up for searching in multibyte and ignore-icase. |
Date: |
Mon, 24 Aug 2009 01:49:39 +0000 |
User-agent: |
Mozilla/5.0 (Windows; U; Windows NT 5.1; ja; rv:1.9.1.2) Gecko/20090729 Firefox/3.5.2 |
URL:
<http://savannah.gnu.org/patch/?6899>
Summary: Speed-up for searching in multibyte and
ignore-icase.
Project: grep
Submitted by: noritnk
Submitted on: 2009年08月24日 01時49分38秒
Category: None
Priority: 5 - Normal
Status: None
Privacy: Public
Assigned to: None
Originator Email:
Open/Closed: Open
Discussion Lock: Any
_______________________________________________________
Details:
Kwset is an excellent algorithm that selectively uses Boyer-Moore and
Commentz-Walter algorithm. However, it is necessary to convert the
capital-letter into the small-letter of a text before searching in
ignore-icase.
Though DFA is also an excellent algorithm, Multi-byte string cannot be
efficiently treated by invalidating it in a lot of Linux distributions.
These patches will enable to search a pattern in a text efficiently,
without converting the text from the capital-letter into the small-letter,
by using not searching but matching in multibyte and ignore-icase.
Patched grep will be faster than in the Linux distribution in a lot
of cases.
Fixed bugs :
- Bug#387704: grep: -i breaks W in some locales (perhaps UTF-8 locale
krown issue:
- Included regex has no RE_ICASE. It means new regex is needed,
following issue is reproduced.
http://lists.gnu.org/archive/html/bug-grep/2009-05/msg00009.html
Replace regex.h with the one of the system.
_______________________________________________________
File Attachments:
-------------------------------------------------------
Date: 2009年08月24日 01時49分38秒 Name: grep-2.5.4.dfa-speedup.patch
Size: 43kB By: noritnk
<http://savannah.gnu.org/patch/download.php?file_id=18630>
-------------------------------------------------------
Date: 2009年08月24日 01時49分38秒 Name: grep-2.5.4.kwsmb.patch Size:
19kB By: noritnk
<http://savannah.gnu.org/patch/download.php?file_id=18629>
-------------------------------------------------------
Date: 2009年08月24日 01時49分38秒 Name: grep-2.5.4.mb-speedup.patch
Size: 9kB By: noritnk
<http://savannah.gnu.org/patch/download.php?file_id=18628>
-------------------------------------------------------
Date: 2009年08月24日 01時49分38秒 Name: grep-2.5.4.misc.patch Size:
2kB By: noritnk
<http://savannah.gnu.org/patch/download.php?file_id=18631>
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/patch/?6899>
_______________________________________________
Message sent via/by Savannah
http://savannah.gnu.org/
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [patch #6899] Speed-up for searching in multibyte and ignore-icase.,
Norihirio Tanaka <=