bug-bison
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Error UTF-8 strings


From: Ken Moffat
Subject: Re: Error UTF-8 strings
Date: Wed, 24 Jun 2020 15:05:38 +0100
User-agent: Mutt/1.14.5 (2020-06-23)

On Wed, Jun 24, 2020 at 10:20:48AM +0200, Hans Åberg wrote:
> 
> 
> The problem is that I haven't changed my environment variable.
> 
> > LC_ALL=UTF-8
>
> > LC_ALL=fr_FR.UTF-8
> 
> I pointed out that out: There is a double bug, locale dependent generation of 
> the parser file, and relying on software that can't handle LC_CTYPE=UTF-8.

On (at least) linux using glibc, LC_CTYPE requires a valid locale.
And UTF-8 on its own is not a valid locale.

A quick search on google suggests that LC_CTYPE will, among other
things, control what is a valid letter, and lowercase|uppercase
conversions.

Taking an easy case, with languages written in latin alphabets, what
is the uppercase of 'i' ?  In Turkey it is İ (with a dot), because
in turkish dotted-i and dotless-i are different letters.

ĸen
-- 
       He died at the console, of hunger and thirst.
       Next day he was buried, face-down, nine-edge first.
                              - the perfect programmer



reply via email to

[Prev in Thread] Current Thread [Next in Thread]