[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Octave-bug-tracker] [bug #51210] Non-ASCII text displays incorrectly in
From: |
Markus Mützel |
Subject: |
[Octave-bug-tracker] [bug #51210] Non-ASCII text displays incorrectly in plots |
Date: |
Tue, 20 Jun 2017 10:25:22 -0400 (EDT) |
User-agent: |
Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:55.0) Gecko/20100101 Firefox/55.0 |
Follow-up Comment #5, bug #51210 (project octave):
That is right, Matlab uses 16-bit characters. It seems to be using UCS-2:
>> double(char(129296))
ans =
65535
>> double(native2unicode(129296, 'UTF-32'))
ans =
65533
All code points outside the BMP seem to be mapped to 65533 ("replacement
character").
Any UTF encoding could be used to map the entire Unicode range. Indexing
characters generally is difficult in Unicode anyway (e.g. given the character
modifiers). Thus, I reasoned there would be no harm in using Linux' "default
encoding" UTF-8...
Personally, I rarely (never) use characters outside the BMP. Thus, I am not
opposing to limiting ourselves to UCS-2 for compatibility reasons. That would
be a big improvement to the current situation nevertheless. However, that
might make it more difficult for people using less common writing systems and
symbols outside the BMP.
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/bugs/?51210>
_______________________________________________
Message sent via/by Savannah
http://savannah.gnu.org/