[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: diacrit: mark deprecated
From: |
Bruno Haible |
Subject: |
Re: diacrit: mark deprecated |
Date: |
Wed, 17 Jul 2019 23:31:01 +0200 |
User-agent: |
KMail/5.1.3 (Linux/4.4.0-154-generic; KDE/5.18.0; x86_64; ; ) |
Bernhard Voelker wrote:
> In GNU coreutils, we now get this warning during bootstrap:
>
> Notice from module diacrit:
> This module is deprecated. Use the module
> 'uninorm/canonical-decomposition' instead.
>
> And indeed, the 'diacrit' module is still in use by 1 source:
>
> $ GIT_PAGER= git grep -En 'todiac|tobase'
> src/ptx.c:1053: diacritic = todiac (character);
> src/ptx.c:1056: base = tobase (character);
> src/ptx.c:1338: edited_flag[character] = todiac (character) != 0;
Indeed, 'ptx' does not yet support multibyte locales.
$ echo "Böse Bübchen" | ptx -
Böse Bübchen
Böse Bübchen
Böse Bü bchen
Bö se Bübchen
It looks even weirder with Greek input:
$ echo "Το τέλος του Ψυχρού Πολέμου και η διάλυση της Σοβιετικής Ένωσης άφησαν
τις Ηνωμένες Πολιτείες για ένα διάστημα ως τη μόνη υπερδύναμη." | ptx -
(No output at all!)
Bruno