octave-bug-tracker
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte


From: Rik
Subject: [Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters
Date: Sun, 25 Mar 2012 20:30:49 +0000
User-agent: Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:11.0) Gecko/20100101 Firefox/11.0

Update of bug #35910 (project octave):

                Category:             Interpreter => Libraries              
                  Status:                    None => In Progress            
             Assigned to:                    None => rik5                   

    _______________________________________________________

Follow-up Comment #1:

Confirmed on a recent tip.

Currently Octave does not support UTF8 in regular expressions, although it is
not hard to add.  I've made an experimental patch that does so but it does
impact performance.  UTF8 is 23% slower than non-UTF8 pattern matching.  

Adding support for UTF8 is a big enough change that I am going to take the
issue to the Octave Maintainers mailing list so that we can make a decision
about how best to implement this.  It is also big enough that this will come
out not as a bug fix, but in the next major release of Octave (3.8.0).

    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?35910>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]