octave-bug-tracker
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Octave-bug-tracker] [bug #51210] Non-ASCII text displays incorrectly in


From: Markus Mützel
Subject: [Octave-bug-tracker] [bug #51210] Non-ASCII text displays incorrectly in plots
Date: Tue, 20 Jun 2017 10:25:22 -0400 (EDT)
User-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:55.0) Gecko/20100101 Firefox/55.0

Follow-up Comment #5, bug #51210 (project octave):

That is right, Matlab uses 16-bit characters. It seems to be using UCS-2:

>> double(char(129296))
ans =
       65535
>> double(native2unicode(129296, 'UTF-32'))
ans =
       65533


All code points outside the BMP seem to be mapped to 65533 ("replacement
character").

Any UTF encoding could be used to map the entire Unicode range. Indexing
characters generally is difficult in Unicode anyway (e.g. given the character
modifiers). Thus, I reasoned there would be no harm in using Linux' "default
encoding" UTF-8...

Personally, I rarely (never) use characters outside the BMP. Thus, I am not
opposing to limiting ourselves to UCS-2 for compatibility reasons. That would
be a big improvement to the current situation nevertheless. However, that
might make it more difficult for people using less common writing systems and
symbols outside the BMP.

    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?51210>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]