[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Lynx-dev] Lynx mangles some UTF-8 chars on POST (KMM184778069V18779
From: |
Thorsten Glaser |
Subject: |
Re: [Lynx-dev] Lynx mangles some UTF-8 chars on POST (KMM184778069V18779L0KM) |
Date: |
Sat, 3 Mar 2012 16:07:26 +0000 (UTC) |
address@hidden dixit:
>The UTF-8 character string in question was: E2 80 99
>On the wire (line 139 of the Lynx.trace) this gets sent as:
>%C3%A2%80%99, an invalid UTF-8 sequence. The correct string would be:
>%E2%80%99
This is definitively ouch. I was not able to reproduce this
with dev.11 and the following HTML:
<html><body><form action="http://localhost/tmp/se.cgi" method="post">
<textarea name="foo"></textarea>
<input type="submit"/>
</form></body></html>
Whereas se.cgi is (simplified):
#!/apps/mksh
print Content-type: text/plain
print Entropy: $RANDOM
print
set
print --
------------------------------------------------------------------------------
if [[ $REQUEST_METHOD = POST ]]; then
/apps/cat
print
print --
------------------------------------------------------------------------------
fi
exit 0
Can you please retry with dev.12 or, otherwise, try to
find out the difference between Yahoo!’s form and my
reduced testcase? (Things like encoding come to mind;
also try with both HTML5_CHARSET on and off and note
that that switch is broken in your version.)
bye,
//mirabilos
--
FWIW, I'm quite impressed with mksh interactively. I thought it was much
*much* more bare bones. But it turns out it beats the living hell out of
ksh93 in that respect. I'd even consider it for my daily use if I hadn't
wasted half my life on my zsh setup. :-) -- Frank Terbeck in #!/bin/mksh