[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 in

From: Markus Mützel
Subject: [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input
Date: Sun, 3 Nov 2019 08:31:28 -0500 (EST)
User-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:71.0) Gecko/20100101 Firefox/71.0

Update of bug #57107 (project octave):

                 Summary: regexp functions fail on ISO-8859 input => regexp
functions fail on ISO-8859-1 input


Follow-up Comment #20:

The attached patch wires in the validation of UTF-8 at a pretty low level. It
applies on top of "bug57107_validate_u8.patch".
It breaks "strread" and "textread" (and possibly also other things). But I'm
still waiting for feedback on the maintainers mailing list to see if it is
worth looking into why.
Nevertheless, it demonstrates what could happen:

octave:1> char (181)
ans = µ
octave:2> double (ans)
ans =

   194   181

octave:3> char ([181 228])
ans = µä
octave:4> double (ans)
ans =

   194   181   195   164

(file #47784)

Additional Item Attachment:

File name: bug57107_char.patch            Size:9 KB


Reply to this item at:


  Message sent via Savannah

reply via email to

[Prev in Thread] Current Thread [Next in Thread]