aspell-user
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Aspell-user] Spelling of non-ascii characters


From: Mads Ipsen
Subject: Re: [Aspell-user] Spelling of non-ascii characters
Date: Wed, 18 Jun 2008 09:37:07 +0200
User-agent: Thunderbird 2.0.0.14 (X11/20080421)

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Kevin Atkinson wrote:
| On Tue, 17 Jun 2008, Mads Ipsen wrote:
|
|> Each night our manuals are being spell checked by aspell. Certain words,
|> not in the default dictionary, are added to an extra dictionary using
|>
|> ~  cat wordlist.txt | aspell -a --add-extra-dicts=./wordlist.bin
|>
|> If you add a non-ascii based word to wordlist, such as, "Ångström", the
|> word is still treated as an error when aspell is run.
|>
|> If you, however, add the word "Ångström" to the user directory
|> .aspell.en.pws, the word is no longer regarded as a typo by aspell.
|
| What format are is the extra dictionary in?
|

Sorry, I wrote some rubbish; here are the steps we take

1. Create extra (supplemental) dictionary:

~  aspell --lang=en create master ./wordlist.bin < wordlist.txt

where wordlist.txt in an ordinary ascii (text) file. If wordlist.txt
contains the word "Ångström", the above command generates the warning:

~  Warning: The word "�ngström" is invalid. The character '?' (U+85)
may not appear in the middle of a word. Skipping word.


2. Do spell check and use the supplemental dictionary 'wordlist.bin'

~  cat file_to_checked.txt | aspell -a --add-extra-dicts=./wordlist.bin

If file_to_checked.txt contains the word "Ångström", the command
produces the spell error:

~  & Ångström 3 7: Angstrom, Angstroms, Angstrom's

If we add "Ångström" to .aspell.en.pws, the spell error disappears. But
the creation of the extra dictionary still produces the above warning.

Best regards,

Mads


- --
+---------------------------------+-------------------------+
| Mads Ipsen                      |                         |
| Product Support Specialist      | phone:     +45-35320630 |
| Atomistix A/S                   | fax:       +45-35320635 |
| Juliane Maries Vej 30           |                         |
| DK-2100 Copenhagen              | address@hidden       |
+---------------------------------+-------------------------+
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFIWLsiUiXEzEUa8LARAguIAKCVhFCPXbEJnXFb3X38LnkPzy1lGgCgob2V
VsIVtXwy//gGN6tK7+rOTRg=
=3GYh
-----END PGP SIGNATURE-----




reply via email to

[Prev in Thread] Current Thread [Next in Thread]