[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#13041: 24.2; diacritic-fold-search
From: |
Drew Adams |
Subject: |
bug#13041: 24.2; diacritic-fold-search |
Date: |
Sat, 1 Dec 2012 08:38:45 -0800 |
> I don't understand why this thread is talking only about Latin
> characters with diacritics. That is a special case of what Unicode
> calls "compatibility equivalence" (q.e.). For example, even in the
> Latin environments, don't you want to find "sni?" when searching for
> "sniff", and vice versa? And there are similar issues in many
> non-Latin scripts.
Actually, in the original thread I made the same point.
Please see that discussion for this and other points.
http://lists.gnu.org/archive/html/help-gnu-emacs/2012-11/msg00429.html
> The decomposition of a character such as '?' is given by
> the Unicode database... Emacs already supports these
> decomposition properties.
That's good news (new to me). So it sounds like even the most hopeful
wanna-haves of the discussion could perhaps be realized without too much
trouble.
> Using these properties, every search string can be converted to a
> sequence of non-decomposable characters (this process is recursive,
> because the 'decomposition' property can use characters that
> themselves are decomposable). If the user wants to ignore diacritics,
> then the diacritics should be dropped from the decomposition sequence
> before starting the search. E.g., for the decomposition of รจ above,
> we will drop the 768 and will be left with 101, which is 'e'. Then
> searching for that string should apply the same decomposition
> transformation to the text being searched, when comparing them.
>
> This would be the most general way of solving this issue, a way that
> is not limited to diacritics nor to Latin scripts. And doing that
> will move Emacs closer to the goal of being Unicode compatible, since
> support for this is required by the Unicode Standard.
This sounds great. I really hope someone with the time and knowledge adds such
a feature soon (even though, to be clear, I personally do not have much need for
it). I think it would be very handy for many users - most welcome.
- bug#13041: 24.2; diacritic-fold-search, Drew Adams, 2012/12/01
- bug#13041: 24.2; diacritic-fold-search, Eli Zaretskii, 2012/12/01
- bug#13041: 24.2; diacritic-fold-search, Eli Zaretskii, 2012/12/01
- bug#13041: 24.2; diacritic-fold-search,
Drew Adams <=
- bug#13041: 24.2; diacritic-fold-search, Juri Linkov, 2012/12/01
- bug#13041: 24.2; diacritic-fold-search, martin rudalics, 2012/12/02
- bug#13041: 24.2; diacritic-fold-search, Eli Zaretskii, 2012/12/02
- bug#13041: 24.2; diacritic-fold-search, martin rudalics, 2012/12/03
- bug#13041: 24.2; diacritic-fold-search, Eli Zaretskii, 2012/12/03
- bug#13041: 24.2; diacritic-fold-search, martin rudalics, 2012/12/03
- bug#13041: 24.2; diacritic-fold-search, Eli Zaretskii, 2012/12/03
- bug#13041: 24.2; diacritic-fold-search, martin rudalics, 2012/12/04
- bug#13041: 24.2; diacritic-fold-search, Eli Zaretskii, 2012/12/04
- bug#13041: 24.2; diacritic-fold-search, martin rudalics, 2012/12/05