bug#2497: 23.0.91; Fails to read UTF-8 on Win2k

bug-gnu-emacs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#2497: 23.0.91; Fails to read UTF-8 on Win2k

From:	David Engster
Subject:	bug#2497: 23.0.91; Fails to read UTF-8 on Win2k
Date:	Sat, 28 Feb 2009 11:14:16 +0100
User-agent:	Gnus/5.110011 (No Gnus v0.11) Emacs/23.0.91 (gnu/linux)

Uwe Siart <uwe.siart@tum.de> writes:
> Stefan Monnier <monnier@iro.umontreal.ca> writes:
>
>> The guessing shouldn't give priority to buffer-file-coding-system.
>> Instead we have the set-coding-system-priority instead. And IIUC utf-8
>> should always have a pretty high priority since false positives are
>> fairly rare. So this still looks like a real bug.
>
> Here I would like to note that I never had false positives in the past
> (before 23.0.91) but I do have false positives now. Therefore I'm
> inclined to call it a bug.

I second this - this has worked for years without problems, and suddenly
it fails to detect UTF-8 with a Latin-1 environment.

I once again confirmed that this behaviour can be tracked down to this
change in detect_coding_charset in coding.c (revision 1.413):

--- coding.c    7 Feb 2009 10:49:39 -0000       1.412
+++ coding.c    9 Feb 2009 00:42:37 -0000       1.413
@@ -5101,7 +5101,7 @@
   valids = AREF (attrs, coding_attr_charset_valids);
   name = CODING_ID_NAME (coding->id);
   if (VECTORP (Vlatin_extra_code_table)
-      && strcmp ((char *) SDATA (SYMBOL_NAME (name)), "iso-8859-"))
+      && strcmp ((char *) SDATA (SYMBOL_NAME (name)), "iso-8859-") == 0)
     check_latin_extra = 1;
   if (! NILP (CODING_ATTR_ASCII_COMPAT (attrs)))
     src += head_ascii;

I'm inclined to say that this change is wrong, since strcmp will only
return 0 if two strings are exactly equal. In this case though, the
string "iso-8859-" is compared to "iso-8859-1" (in my case), so it
returns 1 and therefore check_latin_extra is not set.

-David

[Prev in Thread]

Current Thread

[Next in Thread]

bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Uwe Siart, 2009/02/27
- bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Eli Zaretskii, 2009/02/27
  - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Uwe Siart, 2009/02/27
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Eli Zaretskii, 2009/02/27
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Uwe Siart, 2009/02/27
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Stefan Monnier, 2009/02/28
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Uwe Siart, 2009/02/28
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, David Engster <=
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Eli Zaretskii, 2009/02/28
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, David Engster, 2009/02/28
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Jason Rumney, 2009/02/28
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Stefan Monnier, 2009/02/28
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Eli Zaretskii, 2009/02/28
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Uwe Siart, 2009/02/28
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Stefan Monnier, 2009/02/28
- bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Juanma Barranquero, 2009/02/27
  - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Uwe Siart, 2009/02/27
    - bug#2497: 23.0.91; Fails to read UTF-8 on Win2k, Juanma Barranquero, 2009/02/27

Prev by Date: bug#2451: 23.0.90; rmail - semi-colon in rmail-primary-inbox-list and movemail
Next by Date: bug#2473: usability issues on emacs's describe-mode
Previous by thread: bug#2497: 23.0.91; Fails to read UTF-8 on Win2k
Next by thread: bug#2497: 23.0.91; Fails to read UTF-8 on Win2k
Index(es):
- Date
- Thread