aspell-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [aspell-devel] Problem regarding aspell and hindi dictionary


From: Jose Da Silva
Subject: Re: [aspell-devel] Problem regarding aspell and hindi dictionary
Date: Tue, 16 May 2006 11:26:58 -0700
User-agent: KMail/1.7.2

On May 16, 2006 03:45 am, gm ravi sastry wrote:
> Hi all,
>        I have installed aspell-0.60 .I have also
> installed the english and hindi dictionaries all from
> http://aspell.sourceforge.net/
>
> The s/w works well while checking english documents
> for misspelt words.However when i use it for spell
> checking for hindi documents it dosent work.It looks
> like it is accepting hindi documents with misspelt
> words.Can anyobody suggest  me what i should
> do??Thanks in advance .Looking forwards to a reply
> from your side
> Ravi Sastry

Within the hindi language tar file, you should find a large file that 
ends in letters ****.cwl (compressed word list) or ****.pz (prezipped 
word list).
Some dictionary files contain more than one of those types of files.

You will want to check that file (or files) for wrong words, but before 
you can read it (or them), you need to decompress it (them) into a 
format that you can read/understand.

To decompress the file, go to the command line and use the prezip-bin 
utility like this (please substitute the ???? with the correct name of 
the compressed file):
prezip-bin -d <????.cwl >hindiwordlist.txt
or
prezip-bin -d <????.pz >hindiwordlist.txt

At this point, you have a decompressed text file "hindiwordlist.txt"

Now with a text utility, take a look at the text file and see if you 
find the misspelt words within the wordlist.

If you find the misspelt words in the wordlist, then that may be the 
cause of the wrong spelling because those words might not belong in 
there, and it would be recommended to let the authour of that 
particular dictionary know about the errors you found so that the 
wordlist can be properly updated.

Another possibility is that they are words that really do belong, but 
they are rarely used, in which case they belong in a separate 
dictionary, sort of like keeping a common English dictionary and a 
rare-english dictionary.... or in Aspell terms, the common or rare 
english dictionaries would likely fit in the different "--size" 
options.

Please check the README file within the hindi dictionary tar for the 
authour.
There also appears to be an [aspell-dict-maintainers] list which may be 
more towards solving your problem. Please check here:
http://lists.gnu.org/archive/html/aspell-dict-maintainers/2006-03/index.html

Hopefully that helps.





reply via email to

[Prev in Thread] Current Thread [Next in Thread]