In other words, parts of the percent-encoded UTF-8 sequences are decoded back to unprintable ASCII characters.
So a better solution might indeed be to change iri->uri to pass the percent-encoded sequences directly to make-uri without attempts at percent-decoding normalization.
Sungjin's modification to the definition of 'unstructured' is in line with the IRI RFC (except of course we will need to add all other character sets besides Hangul).
However, it was already pointed out by Peter and Alex that URIs containing native UTF-8 sequences might results in invalid URLs being sent to systems that do not understand IRIs or UTF-8.
I will modify iri->uri to avoid normalization and see if this would produce ok results.
Ivan
On Tue, Jan 15, 2013 at 12:20 PM, Alex Shinn <address@hidden> wrote: