[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: regexp filter to match non-english characters
From: |
Ted Zlatanov |
Subject: |
Re: regexp filter to match non-english characters |
Date: |
Wed, 05 Nov 2008 13:33:46 -0600 |
User-agent: |
Gnus/5.110011 (No Gnus v0.11) Emacs/23.0.60 (gnu/linux) |
On Wed, 05 Nov 2008 11:26:14 -0600 "Robert D. Crawford" <rdc1x@comcast.net>
wrote:
RDC> I am subscribed to an rss feed from del.icio.us that gives me links to
RDC> pages tagged emacs. There are a lot of titles that come through with
RDC> Asian characters of various kinds. Since I don't read these languages,
RDC> I would like to filter them out.
RDC> Is there a regexp class that will match? Has anyone else done this?
RDC> How might one accomplish this?
One idea is:
(string-match "[^\\000-\\1ff]" "hello") ;; OK
(string-match "[^\\000-\\1ff]" "здрасти") ;; not OK (Unicode characters)
This will match character values over 0x1FF, which is the limit of
extended ASCII. Does that work for you?
Ted
- regexp filter to match non-english characters, Robert D. Crawford, 2008/11/05
- Re: regexp filter to match non-english characters,
Ted Zlatanov <=
- Re: regexp filter to match non-english characters, Robert D. Crawford, 2008/11/05
- Message not available
- Re: regexp filter to match non-english characters, Ted Zlatanov, 2008/11/05
- Re: regexp filter to match non-english characters, Michal Nazarewicz, 2008/11/06
- Re: regexp filter to match non-english characters, Ted Zlatanov, 2008/11/06
- Re: regexp filter to match non-english characters, Robert D. Crawford, 2008/11/06
- Message not available
- Re: regexp filter to match non-english characters, Ted Zlatanov, 2008/11/06
- Re: regexp filter to match non-english characters, Robert D. Crawford, 2008/11/06
- Re: regexp filter to match non-english characters, Robert D. Crawford, 2008/11/06