[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte
From: |
Rik |
Subject: |
[Octave-bug-tracker] [bug #35910] Incorrect regex matching of multi-byte UTF-8 characters |
Date: |
Sun, 25 Mar 2012 20:30:49 +0000 |
User-agent: |
Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:11.0) Gecko/20100101 Firefox/11.0 |
Update of bug #35910 (project octave):
Category: Interpreter => Libraries
Status: None => In Progress
Assigned to: None => rik5
_______________________________________________________
Follow-up Comment #1:
Confirmed on a recent tip.
Currently Octave does not support UTF8 in regular expressions, although it is
not hard to add. I've made an experimental patch that does so but it does
impact performance. UTF8 is 23% slower than non-UTF8 pattern matching.
Adding support for UTF8 is a big enough change that I am going to take the
issue to the Octave Maintainers mailing list so that we can make a decision
about how best to implement this. It is also big enough that this will come
out not as a bug fix, but in the next major release of Octave (3.8.0).
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/bugs/?35910>
_______________________________________________
Message sent via/by Savannah
http://savannah.gnu.org/