help-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: garbage chars when pasting French chars into emacs


From: ken
Subject: Re: garbage chars when pasting French chars into emacs
Date: Thu, 02 Feb 2012 15:00:37 -0500
User-agent: Thunderbird 2.0.0.24 (X11/20111109)

On 02/01/2012 10:55 PM Eli Zaretskii wrote:
Date: Wed, 01 Feb 2012 21:39:22 -0500
From: ken <gebser@mousecar.com>

What is your value of selection-coding-system?  Try setting it to
something like ctext-with-extensions.
Thanks, Eli,

Immediately prior to doing the copy-and-paste I ran all of these:

(set-language-environment               'UTF-8)
(set-default-coding-systems             'utf-8)
(setq file-name-coding-system           'utf-8)
(setq default-buffer-file-coding-system 'utf-8)
(setq coding-system-for-write           'utf-8)
(set-keyboard-coding-system             'utf-8)
(set-terminal-coding-system             'utf-8)
(set-clipboard-coding-system            'utf-8)
(set-selection-coding-system            'utf-8)
(prefer-coding-system                   'utf-8)
(modify-coding-system-alist 'process "\\*shell\\*\\'" 'utf-8-unix)

Not a good idea, I'm afraid: the UTF-8 support in Emacs 21 left a lot
to be desired.

Following your advice, I ran

(set-selection-coding-system 'ctext-with-extensions)

and then did the same copy-and-paste again. This got more of the characters correct, but not all of them.

Looking again, I see my eyes must have been malfunctioning for the text pasted into emacs is rendered correctly. However, when I try to save the text, I'm presented with the minibuffer message "Select coding system (default iso-2022-jp-2): ". At the same time a second buffer opens under the first giving the options for coding system:

===================================================================
These default coding systems were tried:
  mule-utf-8-unix iso-latin-1
However, none of them safely encodes the target text.

Select one of the following safe coding systems:
  iso-2022-jp-2 x-ctext iso-2022-7bit raw-text emacs-mule no-conversion
  ctext-no-compositions iso-2022-8bit-ss2 iso-2022-7bit-lock
  iso-2022-7bit-ss2 tibetan-iso-8bit-with-esc thai-tis620-with-esc
  lao-with-esc korean-iso-8bit-with-esc hebrew-iso-8bit-with-esc
  greek-iso-8bit-with-esc iso-latin-9-with-esc iso-latin-8-with-esc
  iso-latin-5-with-esc iso-latin-4-with-esc iso-latin-3-with-esc
  iso-latin-2-with-esc iso-latin-1-with-esc
  in-is13194-devanagari-with-esc cyrillic-iso-8bit-with-esc
  chinese-iso-8bit-with-esc japanese-iso-8bit-with-esc
===================================================================

Entering "utf-8" into the minibuffer, of course, doesn't work. Frankly, I'd like to save into utf-8, this to avoid having problems with this same file at a later time when I add more text to it.

Can you show the original text, and then what you have after pasting?
I need to see which characters aren't pasted correctly.

The original text must have been edited out somewhere in the thread. It's at <http://www.wikilivres.info/wiki/Maurice_Merleau-Ponty>, from "Francais" through the end of the unordered list, ie, the line ending with "et l’invisible, 1964".

Here it is:

==========================================================
Français

    * La Structure du comportement, 1942
    * La Phénoménologie de la perception, 1945
    * Humanisme et terreur, 1947
    * Sens et non-sens, 1948
    * Les Sciences de l’homme et la phénoménologie
    * Les Relations avec autrui chez l’enfant
* Éloge de la philosophie, leçon inaugurale faite au collège de France, le jeudi 15 janvier 1953
    * Signes, 1960
    * L’œil et l’esprit, Gallimard, 1960
    * Le visible et l’invisible, 1964
==========================================================

Thanks again for your help.



reply via email to

[Prev in Thread] Current Thread [Next in Thread]