[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] error on non-ascii uri
From: |
Andries E. Brouwer |
Subject: |
Re: [Bug-wget] error on non-ascii uri |
Date: |
Wed, 7 May 2014 19:29:35 +0200 |
User-agent: |
Mutt/1.5.21 (2010-09-15) |
On Wed, May 07, 2014 at 04:02:15PM +0100, grimpen wrote:
> if this is not a bug, what is the fix?
>
> $ cat dlx
> http://imaginäreseigentum.de/de/html/index.html
>
> $ wget -N -i dlx
> Incomplete or invalid multibyte sequence encountered
> --2014-05-07 15:16:02--
> http://imagin%E4reseigentum.de/de/html/index.html
> idn_decode failed (9): `System iconv failed'
> Resolving imagin\344reseigentum.de... failed: Parameter string not
> correctly encoded.
>
> $ echo $LANG
> en_US.UTF-8
You have coded ä as %E4 in the file dlx.
That is latin-1 aka ISO 8859-1.
But you promised that you are using en_US.UTF-8.
In UTF-8 ä is coded as 0xc3 0xa4.
If I try your URL, encoded in UTF-8, all is well.