bug-grep
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[patch #6899] Speed-up for searching in multibyte and ignore-icase.


From: Norihirio Tanaka
Subject: [patch #6899] Speed-up for searching in multibyte and ignore-icase.
Date: Mon, 24 Aug 2009 01:49:39 +0000
User-agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; ja; rv:1.9.1.2) Gecko/20090729 Firefox/3.5.2

URL:
  <http://savannah.gnu.org/patch/?6899>

                 Summary: Speed-up for searching in multibyte and
ignore-icase.
                 Project: grep
            Submitted by: noritnk
            Submitted on: 2009年08月24日 01時49分38秒
                Category: None
                Priority: 5 - Normal
                  Status: None
                 Privacy: Public
             Assigned to: None
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any

    _______________________________________________________

Details:

   Kwset is an excellent algorithm that selectively uses Boyer-Moore and
Commentz-Walter algorithm. However, it is necessary to convert the
capital-letter into the small-letter of a text before searching in
ignore-icase.

   Though DFA is also an excellent algorithm, Multi-byte string cannot be
efficiently treated by invalidating it in a lot of Linux distributions.

   These patches will enable to search a pattern in a text efficiently,
without converting the text from the capital-letter into the small-letter,
by using not searching but matching in multibyte and ignore-icase.

   Patched grep will be faster than in the Linux distribution in a lot
of cases.


Fixed bugs :

  - Bug#387704: grep: -i breaks W in some locales (perhaps UTF-8 locale

krown issue:

  - Included regex has no RE_ICASE. It means new regex is needed,
    following issue is reproduced.

      http://lists.gnu.org/archive/html/bug-grep/2009-05/msg00009.html

    Replace regex.h with the one of the system.




    _______________________________________________________

File Attachments:


-------------------------------------------------------
Date: 2009年08月24日 01時49分38秒  Name: grep-2.5.4.dfa-speedup.patch 
Size: 43kB   By: noritnk

<http://savannah.gnu.org/patch/download.php?file_id=18630>
-------------------------------------------------------
Date: 2009年08月24日 01時49分38秒  Name: grep-2.5.4.kwsmb.patch  Size:
19kB   By: noritnk

<http://savannah.gnu.org/patch/download.php?file_id=18629>
-------------------------------------------------------
Date: 2009年08月24日 01時49分38秒  Name: grep-2.5.4.mb-speedup.patch 
Size: 9kB   By: noritnk

<http://savannah.gnu.org/patch/download.php?file_id=18628>
-------------------------------------------------------
Date: 2009年08月24日 01時49分38秒  Name: grep-2.5.4.misc.patch  Size:
2kB   By: noritnk

<http://savannah.gnu.org/patch/download.php?file_id=18631>

    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/patch/?6899>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/





reply via email to

[Prev in Thread] Current Thread [Next in Thread]