info-gnus-english
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: regexp filter to match non-english characters


From: Ted Zlatanov
Subject: Re: regexp filter to match non-english characters
Date: Wed, 05 Nov 2008 13:33:46 -0600
User-agent: Gnus/5.110011 (No Gnus v0.11) Emacs/23.0.60 (gnu/linux)

On Wed, 05 Nov 2008 11:26:14 -0600 "Robert D. Crawford" <rdc1x@comcast.net> 
wrote: 

RDC> I am subscribed to an rss feed from del.icio.us that gives me links to
RDC> pages tagged emacs.  There are a lot of titles that come through with
RDC> Asian characters of various kinds.  Since I don't read these languages,
RDC> I would like to filter them out.  

RDC> Is there a regexp class that will match?  Has anyone else done this?
RDC> How might one accomplish this?

One idea is:

(string-match "[^\\000-\\1ff]" "hello")   ;; OK
(string-match "[^\\000-\\1ff]" "здрасти") ;; not OK (Unicode characters)

This will match character values over 0x1FF, which is the limit of
extended ASCII.  Does that work for you?

Ted


reply via email to

[Prev in Thread] Current Thread [Next in Thread]