aspell-user
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Aspell-user] aspell-<LANG>: Invalid UTF-8 sequence at position...


From: Kevin Atkinson
Subject: Re: [Aspell-user] aspell-<LANG>: Invalid UTF-8 sequence at position...
Date: Mon, 5 Mar 2007 14:34:29 -0700 (MST)

On Sat, 3 Mar 2007, Kevin Atkinson wrote:

On Sat, 3 Mar 2007, Martin Swift wrote:

On Sat, Mar 03, 2007 at 04:29:15AM -0700, Kevin Atkinson wrote:
The word list is likely in iso-8859-1 but Aspell expects it in utf-8.

Does this mean that aspell expects the word lists to have the same
charset as the machine? Isn't that a little odd?

I don't understand the question.

de.dat sets 'charset' as iso-8859-1:

 # cat de.dat
 # Generated with Aspell Dicts "proc" script version 0.50.1
 name de
 charset iso-8859-1
 soundslike de
 affix      de

Does aspell not use this to determine the charset? If not, /shouldn't/
it?

Yes it should.

I just tried

/usr/bin/prezip-bin -d < de-common.cwl | /usr/bin/aspell --lang=de create --encoding=iso8859-1 master ./de-common.rws

Something is wrong. The "--encoding=iso8859-1" should not be necessary. It should be using the value from "charset" in "de.dat". Try setting your locale to "C" and see if it makes a difference.

Also, please do a "aspell dump config" and send the output as an attachment. It could be that the encoding was being set to UTF-8 in a configuration file.





reply via email to

[Prev in Thread] Current Thread [Next in Thread]