[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: On language-dependent defaults for character-folding
From: |
Juri Linkov |
Subject: |
Re: On language-dependent defaults for character-folding |
Date: |
Tue, 23 Feb 2016 02:14:55 +0200 |
User-agent: |
Gnus/5.13 (Gnus v5.13) Emacs/25.0.91 (x86_64-pc-linux-gnu) |
> But the most basic issue is that any significant development in these
> directions require to re-implement the feature on the C level, and use
> char-tables for folding, like we do with case-mapping. So until
> someone steps forward for the job, all we can do is small corrections
> to the existing implementation.
Do I understand correctly that essentially what is necessary to do on the
C level is to extend char-tables with character insertions and deletions,
so in addition to canonical equivalence mappings (like are used for the
existing case-mappings) char-tables should also support matching of
multi-character additions (like combining accents in the search
string) and deletions (like combining accents from the search string
missing in the search text)?
> For example, the default state of character-folding might depend on
> the locale's language -- we could turn it off by default for languages
> whose users expressed dissatisfaction with the feature. We could also
> augment the regular expressions created for folding the search string
> by filtering out variants that users of a particular language don't
> want. If people think these ideas will make more users happy, we can
> work on that.
It seems two user variables are necessary for customization:
1. inclusive folding groups that will include by default such pairs
as o - ø, l - ł added to the Unicode decomposition-based rules,
and allow the users to add more rules;
2. exclusive folding groups to exclude locale/language-dependent rules from
the default mappings above, e.g. removing n - ñ for the "es" locale.
- Re: On language-dependent defaults for character-folding, (continued)
- Re: On language-dependent defaults for character-folding, Eli Zaretskii, 2016/02/22
- Re: On language-dependent defaults for character-folding, Richard Stallman, 2016/02/22
- Re: On language-dependent defaults for character-folding, Eli Zaretskii, 2016/02/22
- Re: On language-dependent defaults for character-folding, Richard Stallman, 2016/02/23
- Re: On language-dependent defaults for character-folding, Richard Stallman, 2016/02/23
- RE: On language-dependent defaults for character-folding, Drew Adams, 2016/02/23
- Re: On language-dependent defaults for character-folding, Eli Zaretskii, 2016/02/23
- Re: On language-dependent defaults for character-folding, Richard Stallman, 2016/02/24
- Re: On language-dependent defaults for character-folding, Richard Stallman, 2016/02/22
- Re: On language-dependent defaults for character-folding, Eli Zaretskii, 2016/02/22
- Re: On language-dependent defaults for character-folding,
Juri Linkov <=
- Re: On language-dependent defaults for character-folding, Eli Zaretskii, 2016/02/23
- Re: On language-dependent defaults for character-folding, Juri Linkov, 2016/02/23
- Re: On language-dependent defaults for character-folding, Eli Zaretskii, 2016/02/24
- Re: On language-dependent defaults for character-folding, Juri Linkov, 2016/02/24
- Re: On language-dependent defaults for character-folding, Eli Zaretskii, 2016/02/25
- Re: On language-dependent defaults for character-folding, Juri Linkov, 2016/02/28
- Re: On language-dependent defaults for character-folding, Eli Zaretskii, 2016/02/29
- Re: On language-dependent defaults for character-folding, Juri Linkov, 2016/02/29
- Re: On language-dependent defaults for character-folding, Richard Stallman, 2016/02/26
- Re: On language-dependent defaults for character-folding, Eli Zaretskii, 2016/02/21