[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
converting between charsets
From: |
Alexander Kotelnikov |
Subject: |
converting between charsets |
Date: |
Sun, 07 May 2006 13:52:08 +0400 |
User-agent: |
Gnus/5.1007 (Gnus v5.10.7) Emacs/21.4 (gnu/linux) |
Hello.
After I switched to utf-8 as my basic environment encoding (on Linux)
I got need of converting some texts sometimes back to koi8-r. Typical
task here is to convert outgoing mail to persons and newsgroups
hierarchies which do not understand multibyte encodings.]
Theoretically something like
(encode-coding-region (point-min) (point-max) 'koi8-r)
should work, but it does not.
There could be three different ways, which I checked, how characters
to be converted can appear in emacs buffer:
a. when I open such file.
b. when I type in characters and my keyboard layout in X is different
from 'us', for me it is normally 'ru' then.
c. when I type in after I used toggle-input-method.
And the trouble is that encode-coding-region converts only in case
(c). In (a) and (b) characters that need conversion are substituted
with question marks. And even in (c) conversion is performed (if, for
instance, I save a file after it appears to be in koi8-r) in the
converted buffer converted characters are shown in \321 manner.
So, it will be nice to get some help on this, thanks.
Relevant lines in my ~/.emacs are:
(set-language-environment "UTF-8")
(set-terminal-coding-system 'utf-8)
(set-selection-coding-system 'utf-8)
(setq default-buffer-file-coding-system 'utf-8)
(set-input-mode (car (current-input-mode)) (nth 1 (current-input-mode)) 0)
(setq default-input-method "cyrillic-jcuken")
BTW, there are other troubles with handling charters other that first
half of ASCII table:
1. Paste in X (from non-Emacs to Emacs) does not work correctly. It
seems to be broken in different ways for singlebyte and mutlibyte.
2. With my utf-8 setup non-ascii input does not work on terminal (for
example, when emacs is run in xterm as emacs -nw) when I switch input
with system means (X keyboard layout, console input mode), instead of
toggle-input-method.
Probably, somebody can comment on this also.
Thanks once more,
--
Alexander Kotelnikov
Saint-Petersburg, Russia
- converting between charsets,
Alexander Kotelnikov <=
- Re: converting between charsets, Stefan Monnier, 2006/05/07
- Re: converting between charsets, Alexander Kotelnikov, 2006/05/07
- Re: converting between charsets, Stefan Monnier, 2006/05/07
- Re: converting between charsets, Alexander Kotelnikov, 2006/05/08
- Re: converting between charsets, Stefan Monnier, 2006/05/08
- Re: converting between charsets, Alexander Kotelnikov, 2006/05/09
- Re: converting between charsets, Stefan Monnier, 2006/05/09
- Re: converting between charsets, Alexander Kotelnikov, 2006/05/13
- Re: converting between charsets, Stefan Monnier, 2006/05/13
- Re: converting between charsets, Alexander Kotelnikov, 2006/05/14