--- Begin Message ---
Subject: |
Speed-up for case-insensitive matching in multibyte locales |
Date: |
Sun, 12 Jan 2014 16:09:20 +0900 |
Package: grep
Tags: patch
Case-insensitive matching is expensive in multi-byte locales because of
conversion of targeted text to lower case.
However, I seem that awk which uses dfa.c as well as grep don't covert
target text to lower case. I seem that if grep don't use kwset, it
doesn't also have to convert.
If this patch is applied, when no parenthesis and/or backslash is
included in keywords (*), use of kwset and convesion of target is
avoided for case-insensitive matching in multi-byte locales, and
process for ignore-case is accomplished in dfaexec and regex.
(*) When parenthesis and/or backslash is included in keywords,
it's converted to case-sensitive matching. (bug#16232)
grep-ignore-icase.txt
Description: Binary data
--- End Message ---
--- Begin Message ---
Subject: |
Re: bug#16421: Speed-up for case-insensitive matching in multibyte locales |
Date: |
Sun, 26 Jan 2014 08:55:01 -0800 |
I've pushed those three commits, with a small change to the second one
(removing the leading ^ and trailing '\$' in a regexp) to make that
test succeed also with -F.
--- End Message ---