info-gnus-english
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: regexp filter to match non-english characters


From: Robert D. Crawford
Subject: Re: regexp filter to match non-english characters
Date: Wed, 05 Nov 2008 14:14:31 -0600
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.60 (gnu/linux)

Hello Ted,

Ted Zlatanov <tzz@lifelogs.com> writes:

> On Wed, 05 Nov 2008 11:26:14 -0600 "Robert D. Crawford" 
> <rdc1x@comcast.net> wrote:
>
>> I am subscribed to an rss feed from del.icio.us that gives me
>> links to pages tagged emacs.  There are a lot of titles that come
>> through with Asian characters of various kinds.  Since I don't
>> read these languages, I would like to filter them out.
>
>> Is there a regexp class that will match?  Has anyone else done
>> this?  How might one accomplish this?
>
> One idea is:
>
> (string-match "[^\\000-\\1ff]" "hello")   ;; OK
> (string-match "[^\\000-\\1ff]" "здрасти") ;; not OK (Unicode characters)
>
> This will match character values over 0x1FF, which is the limit of
> extended ASCII.  Does that work for you?

Will this match the unicode double ">" and the like?  Some people feel
the need to use these in their breadcrumbs and such.  If there is no way
to just filter out the foreign characters, I will use it.  

The other possibility is to lower permanently on each character that is
read to me, but this seems tedious and time consuming on my part and
likely slow for gnus to score.

Thanks for your help,
rdc
-- 
Robert D. Crawford                                      rdc1x@comcast.net

To add insult to injury.
                -- Phaedrus





reply via email to

[Prev in Thread] Current Thread [Next in Thread]