[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
New branch for charset encoding issues.
From: |
John Darrington |
Subject: |
New branch for charset encoding issues. |
Date: |
Fri, 27 Mar 2009 14:36:39 +0900 |
User-agent: |
Mutt/1.5.13 (2006-08-11) |
I've started a new branch for fixing character set encoding issues.
So far, it reads record 7, subtype 20 to find out the ostensible
encoding of a dataset. It stores this encoding name in the
dictionary. The global "PSPP" encoding is no more.
Things to do before this branch is merged:
* Saving files should write record 7(20).
* More intelligent fallback if 7(20) isn't found.
* Update developers guide.
* Check what happens when mergeing (eg with MATCH, ADD, UPDATE)
datafiles with different encodings.
* Should add some manual override.
Anyway it opens and correctly displays Korean, Japanese and Slovenian
files now.
Comments welcome.
J'
--
PGP Public key ID: 1024D/2DE827B3
fingerprint = 8797 A26D 0854 2EAB 0285 A290 8A67 719C 2DE8 27B3
See http://pgp.mit.edu or any PGP keyserver for public key.
signature.asc
Description: Digital signature
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- New branch for charset encoding issues.,
John Darrington <=