aspell-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[aspell-devel] Building Aspell on Red Hat Linux 9


From: Lars Aronsson
Subject: [aspell-devel] Building Aspell on Red Hat Linux 9
Date: Mon, 2 Jun 2003 20:44:25 +0200 (CEST)

A friend tried to build Aspell's Swedish dictionary (aspell-sv-1.3.8)
on Red Hat Linux 9.  Apparently, this Linux distribution sets
LANG=en_US.UTF-8 by default and the dictionary is written in
ISO8859-1.  This is what happened:

aspell-sv-1.3.8> make all
./unsq < words-sv.sq | aspell --local-data-dir=./ --lang=svenska
create master ./svenska
Malformed UTF-8 character (unexpected end of string) at ./unsq line
51, <stdin> line 7.
Malformed UTF-8 character (unexpected end of string) at ./unsq line
51, <stdin> line 105.
Malformed UTF-8 character (unexpected end of string) at ./unsq line
51, <stdin> line 106.

which of course is quite confusing to a novice.

The matter here is not en_US or sv_SE, but the fact that some
ISO8859-1 characters are interpreted as prefixes for UTF-8 two byte
sequences.  I think the Makefile should explicitly set the LANG and/or
LC_CTYPE environment variables before running "unsq" and "aspell".
(Unsq or un-squeeze is a Perl script distributed with aspell-sv-1.3.8)

Is there an official Aspell policy to move to UTF-8?  Does the
Swedish/Danish dictionary team at SSLUG any decision on this?
It appears that Red Hat Linux has taken this move already in version 8,
http://www.redhat.com/docs/manuals/linux/RHL-8.0-Manual/release-notes/x86/


-- 
  Lars Aronsson (address@hidden)
  Aronsson Datateknik - http://aronsson.se/





reply via email to

[Prev in Thread] Current Thread [Next in Thread]