nmh-workers
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Nmh-workers] nmh 1.6: character set checks and exmh compatibility


From: Ken Hornstein
Subject: Re: [Nmh-workers] nmh 1.6: character set checks and exmh compatibility
Date: Mon, 17 Oct 2016 17:33:23 -0400

>BTW, WRT spotting multi-byte UTF-8 encoding, I don't think that's a
>goer.  Valid UTF-8 and valid GB2312 can share the same sequences,
>especially if it's just the odd `£' or `拢` in ASCII text.

It was just a suggestion, not one I was particularly crazy about ... but
not all arbitrary 8-bit sequences are valid UTF-8.  And it looks like
for GB2312 (using the EUC-CN encoding, right?) it would be harder, but
there are certainly invalid sequences for GB2312.  Although I do not
think this is a business we should be in; pick your locale properly
or explicitly specify a character set in the draft.

--Ken



reply via email to

[Prev in Thread] Current Thread [Next in Thread]