lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: lynx-dev chartrans to CJK-like display (was: stopping when viewin


From: Leonid Pauzner
Subject: Re: lynx-dev chartrans to CJK-like display (was: stopping when viewing a site)
Date: Thu, 26 Aug 1999 03:14:54 +0400 (MSD)

21-Aug-99 20:43 Klaus Weide wrote:
> On Sat, 21 Aug 1999, Henry Nelson wrote:

>> I have relied on this behavior in the past to create my own default
>> character set, by simply copying it over src/chrtrans/def7_uni.tbl.
>> (An example entry is "0x5c U+00a5" to replace "U+00a5:YEN" because
>> this gives me a true yen sign on a Japanese Windows machine).  It is
>> a kind of trick to get a display character set that matches the
>> machine I am running the terminal emulator on.  It is some hybrid
>> cp???_uni.tbl.  I suppose I could even contact Microsoft to find
>> out what the "official" one is, if I cared enough.

FTP Directory: ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/

 [DIR]   Parent Directory
 [FILE]  CP1250.TXT . . . . . . . . . . .  [Sep 17  1998]     10k
 [FILE]  CP1251.TXT . . . . . . . . . . .  [Sep 17  1998]     10k
 [FILE]  CP1252.TXT . . . . . . . . . . .  [Sep 17  1998]     10k
 [FILE]  CP1253.TXT . . . . . . . . . . .  [Sep 17  1998]      9k
 [FILE]  CP1254.TXT . . . . . . . . . . .  [Sep 17  1998]     10k
 [FILE]  CP1255.TXT . . . . . . . . . . .  [Sep 17  1998]      9k
 [FILE]  CP1256.TXT . . . . . . . . . . .  [Jan  6  1999]      9k
 [FILE]  CP1257.TXT . . . . . . . . . . .  [Sep 17  1998]     10k
 [FILE]  CP1258.TXT . . . . . . . . . . .  [Sep 17  1998]     10k
 [FILE]  CP874.TXT. . . . . . . . . . . .  [Sep 17  1998]      9k
 [FILE]  CP932.TXT. . . . . . . . . . . .  [Sep 17  1998]    297k
 [FILE]  CP936.TXT. . . . . . . . . . . .  [Sep 17  1998]    820k
 [FILE]  CP949.TXT. . . . . . . . . . . .  [Sep 17  1998]    790k
 [FILE]  CP950.TXT. . . . . . . . . . . .  [Sep 17  1998]    511k

I think you noticed that most (256-chars) pages all around 10Kb
while few others hold hundreds kilobytes of data (does that CJK pages?).
The size of the table may be a real problem when thinking on translation
of that type.

> IMO, the more logical way to do this would be the other way around -
> instead of modifying def7_uni (which after all stands for "us-ascii",
> and should be 7-bit only), start with "Japanese (EUC-JP)" and modify
> *that*.  Give this CJK character set a .tbl file, put the additional
> strings in etc. as for other .tbl files, but also add a line 'R 5'
> (see comments in utf8_uni.tbl).  Then treat this like other tables
> included in UCdomap.c, and remove the special definition in UCdomap.h.

> I don't know how lynx would deal with a character set that is CJK *and*
> has translation tables - so far this has just not come up.  Changes
> will be needed in various places, at least so that lynx doesn't skip
> table translation immediately when it sees CJK.  Still I think this
> would be the more logical way to add this kind of limited unicode-to-
> CJK-display translation.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]