Re: Ispell and unibyte characters

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Ispell and unibyte characters

From:	Eli Zaretskii
Subject:	Re: Ispell and unibyte characters
Date:	Thu, 12 Apr 2012 22:01:30 +0300

> Date: Thu, 12 Apr 2012 16:36:57 +0200
> From: Agustin Martin <address@hidden>
> 
> I am still dealing with an open issue here. Some languages have non 7bit
> wordchars, like Catalan middledot, and it should be converted to UTF-8 if
> default communication language is changed to UTF-8.

Sorry, I don't understand: do you mean "non 8-bit wordchars"?  I don't
think 7 bits is assumed anywhere.

Assuming you did mean 8-bit, then why not use UTF-8 for Catalan from
the get-go?  Only some languages can use single-byte encodings, and
evidently Catalan is not one of them.  For that matter, why shouldn't
aspell and hunspell use UTF-8 by default (something I already asked)?

> I have looked at the encoding stuff and I am currently trying something
> like
> 
> (if ispell-encoding8-command
>     ;; Convert non 7bit otherchars to utf-8 if needed
>     (encode-coding-string
>      (decode-coding-string (nth 3 adict) (nth 7 adict))
>      'utf-8)
>   (nth 3 adict)) ; otherchars
> 
> to get new UTF-8 string where
> 
> (nth 7 adict) -> dict-coding-system
> (nth 3 adict) -> Original otherchars
> 
> but get a sgml-lexical-context error. Need to look more carefuly, so this
> will take longer. I am far from expert in handling encodings, so comments
> are welcome.

I don't understand what are you trying to accomplish by encoding
OTHERCHARS in UTF-8.  What exactly is the problem with them being
encoded in some 8-bit encoding?  Please explain.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Ispell and unibyte characters, Agustin Martin, 2012/04/10
- Re: Ispell and unibyte characters, Eli Zaretskii, 2012/04/10
  - Re: Ispell and unibyte characters, Agustin Martin, 2012/04/12
    - Re: Ispell and unibyte characters, Eli Zaretskii <=
    - Re: Ispell and unibyte characters, Agustin Martin, 2012/04/13
    - Re: Ispell and unibyte characters, Eli Zaretskii, 2012/04/13
    - Re: Ispell and unibyte characters, Agustin Martin, 2012/04/13
    - Re: Ispell and unibyte characters, Stefan Monnier, 2012/04/13
    - Re: Ispell and unibyte characters, Agustin Martin, 2012/04/13
    - Re: Ispell and unibyte characters, Stefan Monnier, 2012/04/13
    - Re: Ispell and unibyte characters, Agustin Martin, 2012/04/14
    - Re: Ispell and unibyte characters, Stefan Monnier, 2012/04/15
    - Re: Ispell and unibyte characters, Agustin Martin, 2012/04/20
    - Re: Ispell and unibyte characters, Eli Zaretskii, 2012/04/20

Prev by Date: Re: .info files
Next by Date: Patch bug triage
Previous by thread: Re: Ispell and unibyte characters
Next by thread: Re: Ispell and unibyte characters
Index(es):
- Date
- Thread