[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Pattern replacement fails if string contains multibyte characters
From: |
Andreas Schwab |
Subject: |
Re: Pattern replacement fails if string contains multibyte characters |
Date: |
Fri, 28 Sep 2007 23:19:00 +0200 |
User-agent: |
Gnus/5.110006 (No Gnus v0.6) Emacs/22.1 (gnu/linux) |
Bernd Eggink <monoped@sudrala.de> writes:
> This happens on a utf-8 based system (CRUX 2.3), LANG=de_DE.UTF-8:
>
> t="123abc456äöüABCD"
> echo ${t//[a-c]/}
> # output: 123456öüCD
Which is correct. [a-c] matches every character between a and c
(inclusive) in the collating sequence defined by the locale. For your
locale that includes characters like ä and A. You should avoid the use
of ranges when not using the C locale.
Andreas.
--
Andreas Schwab, SuSE Labs, schwab@suse.de
SuSE Linux Products GmbH, Maxfeldstraße 5, 90409 Nürnberg, Germany
PGP key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5
"And now for something completely different."