[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Lynx-dev] Dumps Unicode file in broken encoding.
From: |
Thorsten Glaser |
Subject: |
Re: [Lynx-dev] Dumps Unicode file in broken encoding. |
Date: |
Mon, 29 Sep 2008 11:15:02 +0000 (UTC) |
Atsuhito Kohda dixit:
>> aqwa『~』$ lynx.cur --dump test.html
>> * é
>> *
This is very interesting:
address@hidden:~ $ lynx -dump test.htm
* é
* �
address@hidden:~ $ lynx -version
Lynx Version 2.8.7dev.9 (27 Apr 2008)
libwww-FM 2.14, SSL-MM 1.4.1, ncurses 5.6.20080830(wide)
Built on linux-gnu Sep 2 2008 02:14:03
────
address@hidden:~ $ lynx -dump test.htm
* é
* à
address@hidden:~ $ lynx -version
Lynx Version 2.8.7dev.8-MirOS (17 Feb 2008)
libwww-FM 2.14, SSL-MM 1.4.1, OpenSSL 0.9.7m, ncurses 5.5.20051010(wide)
No compilation information available.
────
But this is also the key to the solution, because MirBSD always
has the UTF-8 locale (also for assumed charset and display charset),
so, back on Debian, I get this:
address@hidden:~ $ lynx -assume_charset=utf-8 -display_charset=utf-8 -dump
test.htm
* é
* à
This, in the end, means that the stock lynx-cur configuration is broken,
and this is not an upstream problem.
bye,
//mirabilos
--
Sometimes they [people] care too much: pretty printers [and syntax highligh-
ting, d.A.] mechanically produce pretty output that accentuates irrelevant
detail in the program, which is as sensible as putting all the prepositions
in English text in bold font. -- Rob Pike in "Notes on Programming in C"