--- Begin Message ---
Subject: |
23.0.90; Emacs fails to detect utf-8 encoding with language environment Latin-1 |
Date: |
Tue, 17 Feb 2009 11:35:11 +0100 |
User-agent: |
Gnus/5.110011 (No Gnus v0.11) Emacs/23.0.90 (gnu/linux) |
This is what I believe to be a regression in CVS Emacs since the
23.0.90 pretest. I'm using a fresh CVS checkout from 2009-02-17,
compiled with 'make bootstrap'.
You can reproduce it as follows:
1. emacs -Q
2. M-x set-language-environment RET Latin-1 RET
3. In some buffer write:
(ucs-insert "2500")
4. Eval it, so that the unicode character is inserted into the buffer.
5. Save the file and choose utf-8 as encoding.
6. Kill the buffer.
7. Load the file you just saved.
Result: Emacs displays "รข\224\200" for the unicode character.
Expected behaviour: Emacs should detect utf-8 encoding and display
correct character.
Please note that this has worked without problems with the Emacs
23.0.90 pretest, so it must be due to some change(s) since then in CVS.
In GNU Emacs 23.0.90.1 (i686-pc-linux-gnu, GTK+ Version 2.12.11)
of 2009-02-17 on void
Windowing system distributor `The X.Org Foundation', version 11.0.10402000
configured using `configure '--prefix=/usr/local/emacs''
Important settings:
value of $LC_ALL: nil
value of $LC_COLLATE: nil
value of $LC_CTYPE: nil
value of $LC_MESSAGES: nil
value of $LC_MONETARY: nil
value of $LC_NUMERIC: nil
value of $LC_TIME: nil
value of $LANG: nil
value of $XMODIFIERS: nil
locale-coding-system: nil
default-enable-multibyte-characters: t
Major mode: Lisp Interaction
Minor modes in effect:
tooltip-mode: t
tool-bar-mode: t
mouse-wheel-mode: t
menu-bar-mode: t
file-name-shadow-mode: t
global-font-lock-mode: t
font-lock-mode: t
blink-cursor-mode: t
global-auto-composition-mode: t
auto-composition-mode: t
auto-encryption-mode: t
auto-compression-mode: t
line-number-mode: t
transient-mark-mode: t
Recent input:
M-x r e p o <tab> r <tab> C-g M-x s e t - l a n <tab>
<return> L a t i n w <backspace> - w <return> <backspace>
1 <return> M-x r e p o <tab> r <tab> <return>
Recent messages:
For information about GNU Emacs and the GNU system, type C-h C-a.
Making completion list...
Quit
Making completion list...
--- End Message ---
--- Begin Message ---
Subject: |
Re: bug#2497: 23.0.91; Fails to read UTF-8 on Win2k |
Date: |
Sat, 28 Feb 2009 14:21:08 +0200 |
> From: David Engster <deng@randomsample.de>
> Date: Fri, 27 Feb 2009 18:46:12 +0100
> Cc: emacs-pretest-bug@gnu.org, 2497@emacsbugs.donarmstrong.com
>
> Uwe Siart <uwe.siart@tum.de> writes:
> > I'm using the windows port of 23.0.91 on Win2k SP4 and I found that it
> > fails to read utf-8 encoded files correctly. When visiting a file in
> > utf-8 encoding all characters above 255 are screwed up and "C-h C RET"
> > indicates iso-latin1-dos for saving the file. This has not been an
> > issue in 23.0.90.
>
> Maybe this is a duplicate of what I reported in
>
> http://emacsbugs.donarmstrong.com/cgi-bin/bugreport.cgi?bug=2354
>
> As I write later in that bug report, I think I could track down this
> issue to the change in revision 1.413 of src/coding.c. Maybe you could
> try if the same applies to your problem.
Should be fixed by this change:
2009-02-28 Eli Zaretskii <eliz@gnu.org>
* coding.c (detect_coding_charset): Fix change from 2008-10-21.
Also, check iso-latin-*, not only iso-8859-*.
--- End Message ---