[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Unac-devel] problem encoding ß
From: |
Loic Dachary |
Subject: |
Re: [Unac-devel] problem encoding ß |
Date: |
Fri, 6 Sep 2002 16:00:26 +0200 |
mark warren bracher writes:
> I downloaded the latest unac lib and the Text::Unaccent perl module, and
> it all looks great.
>
> I started throwing as much Spanish/French/German as I remember at it,
> and I've come up with one oddity. The German S-set (or sz ligature,
> those are the only two names I know it by) ß passes straight through any
> attempt to unaccent. It should be encoded as
>
> ß -> ss
>
> in much the same way that the ae ligature
>
> æ -> ae
>
I could not say which transformation is the correct one. Is it
ß -> ß or is it ß -> ss ? Could you get an authoritative answer ? I'll
implement whatever is needed to comply to it.
Cheers,
--
Loic Dachary http://www.dachary.org/ address@hidden
12 bd Magenta http://www.senga.org/ address@hidden
75010 Paris T: 33 1 42 45 07 97 address@hidden
GPG Public Key: http://www.dachary.org/loic/gpg.txt