emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: codepage.el: Codepage Mapping, Latin-7


From: Eli Zaretskii
Subject: Re: codepage.el: Codepage Mapping, Latin-7
Date: Fri, 17 Mar 2006 12:48:29 +0200

> From: Reiner Steib <address@hidden>
> Cc: Eli Zaretskii <address@hidden>
> Date: Thu, 16 Mar 2006 15:35:57 +0100
> 
> from lisp/international/codepage.el:
> 
> --8<---------------cut here---------------start------------->8---
> ;; Support for the Windows 12xx series of codepages that MS has
> ;; butchered from the ISO-8859 specs. This does not add support for
> ;; the extended characters that MS has added in the 128 - 159 coding
> ;; range, only translates those characters that can be expressed in
> ;; the corresponding iso-8859 charset.
> 
> ;; Codepage Mapping:
> ;;
> ;; Windows-1250: ISO-8859-2 (Central Europe) - differs in some positions
> ;; Windows-1251: ISO-8859-5 (Cyrillic)       - differs wildly
> ;; Windows-1252: ISO-8859-1 (West Europe)    - exact match
> ;; Windows-1253: ISO-8859-7 (Greek)          - differs in some positions
> ;; Windows-1254: ISO-8859-9 (Turkish)        - exact match
> ;; Windows-1255: ISO-8859-8 (Hebrew)         - exact match
> ;; Windows-1256: ISO-8859-6 (Arabic)         - half match
> ;; Windows-1257: ISO-8859-4 (Baltic)         - differs, future Latin-7
> ;; Windows-1258: VISCII (Vietnamese)         - Completely different
> --8<---------------cut here---------------end--------------->8---
> 
> I think "exact match" is a little bit misleading, because
> e.g. Windows-1252 is a superset of ISO-8859-1.  When reading the
> comment on the "128 - 159 coding range" carefully it becomes more
> clear the mapping table only refers to ISO-8859 code points.

Yes, the match is "exact" only in one direction: from ISO-8859
encoding to the codepage, but not the other way around.

It is meaningless to say that Windows-1252 is a ``superset'' of
ISO-8859-1, since these are encodings, not character sets.  The
underlying Emacs character set does not include characters which can
be encoded by Windows-1252, but codepage.el is only intersted in
encoding the ISO-8859-1 charset, not anything beyond that.

> BTW: The comment in `codepage.el' says that Latin-7 isn't official and
> the it's not supported by Emacs.  This doesn't seem to be correct
> anymore.

codepage.el is obsolete and is not developed anymore.  It is only used
by the MS-DOS port of Emacs; everything else uses code-pages.el.
codepage.el will die a natural death when the MS-DOS port dies,
probably when v23.1 is released.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]