[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Aspell-user] Spelling of non-ascii characters
From: |
Mads Ipsen |
Subject: |
Re: [Aspell-user] Spelling of non-ascii characters |
Date: |
Wed, 18 Jun 2008 09:37:07 +0200 |
User-agent: |
Thunderbird 2.0.0.14 (X11/20080421) |
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Kevin Atkinson wrote:
| On Tue, 17 Jun 2008, Mads Ipsen wrote:
|
|> Each night our manuals are being spell checked by aspell. Certain words,
|> not in the default dictionary, are added to an extra dictionary using
|>
|> ~ cat wordlist.txt | aspell -a --add-extra-dicts=./wordlist.bin
|>
|> If you add a non-ascii based word to wordlist, such as, "Ångström", the
|> word is still treated as an error when aspell is run.
|>
|> If you, however, add the word "Ångström" to the user directory
|> .aspell.en.pws, the word is no longer regarded as a typo by aspell.
|
| What format are is the extra dictionary in?
|
Sorry, I wrote some rubbish; here are the steps we take
1. Create extra (supplemental) dictionary:
~ aspell --lang=en create master ./wordlist.bin < wordlist.txt
where wordlist.txt in an ordinary ascii (text) file. If wordlist.txt
contains the word "Ångström", the above command generates the warning:
~ Warning: The word "�ngström" is invalid. The character '?' (U+85)
may not appear in the middle of a word. Skipping word.
2. Do spell check and use the supplemental dictionary 'wordlist.bin'
~ cat file_to_checked.txt | aspell -a --add-extra-dicts=./wordlist.bin
If file_to_checked.txt contains the word "Ångström", the command
produces the spell error:
~ & Ångström 3 7: Angstrom, Angstroms, Angstrom's
If we add "Ångström" to .aspell.en.pws, the spell error disappears. But
the creation of the extra dictionary still produces the above warning.
Best regards,
Mads
- --
+---------------------------------+-------------------------+
| Mads Ipsen | |
| Product Support Specialist | phone: +45-35320630 |
| Atomistix A/S | fax: +45-35320635 |
| Juliane Maries Vej 30 | |
| DK-2100 Copenhagen | address@hidden |
+---------------------------------+-------------------------+
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iD8DBQFIWLsiUiXEzEUa8LARAguIAKCVhFCPXbEJnXFb3X38LnkPzy1lGgCgob2V
VsIVtXwy//gGN6tK7+rOTRg=
=3GYh
-----END PGP SIGNATURE-----