aspell-user
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Aspell-user] aspell-<LANG>: Invalid UTF-8 sequence at position...


From: Kevin Atkinson
Subject: Re: [Aspell-user] aspell-<LANG>: Invalid UTF-8 sequence at position...
Date: Sat, 3 Mar 2007 04:29:15 -0700 (MST)

On Sat, 3 Mar 2007, Martin Swift wrote:

On Sat, Mar 03, 2007 at 02:56:49AM -0700, Kevin Atkinson wrote:
Can you please give me some more context.  Where/When do you get these
errors.

Sorry.

While compiling (Gentoo's emerge).

  app-dicts/aspell-de-0.60_pre20030222  268 kB
  app-dicts/aspell-is-0.51.1.0  317 kB

Output:

 >>> Emerging (1 of 1) app-dicts/aspell-de-0.60_pre20030222 to /
  * aspell6-de-20030222-1.tar.bz2 MD5 ;-) ...        [ ok ]
  * aspell6-de-20030222-1.tar.bz2 RMD160 ;-) ...     [ ok ]
  * aspell6-de-20030222-1.tar.bz2 SHA1 ;-) ...       [ ok ]
  * aspell6-de-20030222-1.tar.bz2 SHA256 ;-) ...     [ ok ]
  * aspell6-de-20030222-1.tar.bz2 size ;-) ...       [ ok ]
  * checking ebuild checksums ;-) ...                [ ok ]
  * checking auxfile checksums ;-) ...               [ ok ]
  * checking miscfile checksums ;-) ...              [ ok ]
  * checking aspell6-de-20030222-1.tar.bz2 ;-) ...   [ ok ]
 >>> Unpacking source...
 >>> Unpacking aspell6-de-20030222-1.tar.bz2 to
 >>> /var/tmp/portage/app-dicts/aspell -de-0.60_pre20030222/work
 >>> Source unpacked.
 >>> Compiling source in
 >>> 
/var/tmp/portage/app-dicts/aspell-de-0.60_pre20030222/work/aspell6-de-20030222-1 ...
 Finding Dictionary file location ... /usr/lib/aspell-0.60
 Finding Data file location ... /usr/lib/aspell-0.60
 /usr/bin/prezip-bin -d < de-common.cwl | /usr/bin/aspell  --lang=de create 
master ./de-common.rws
 /usr/bin/prezip-bin -d < de_AT-only.cwl | /usr/bin/aspell  --lang=de create 
master ./de_AT-only.rws
 Warning: The string "Abh<F6>rma<DF>nahme/N" is invalid. Invalid UTF-8 sequence 
at position 4. Skipping string.

The word list is likely in iso-8859-1 but Aspell expects it in utf-8. Your locale settings _should_ not have an effect here. What does have an effect is the setting the the language data file "de.dat", in particular "data-encoding". See
  http://aspell.net/man-html/The-Language-Data-File.html





reply via email to

[Prev in Thread] Current Thread [Next in Thread]