[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Different names for Unicode codepoint
From: |
tomas |
Subject: |
Re: Different names for Unicode codepoint |
Date: |
Thu, 21 Apr 2016 21:40:20 +0200 |
User-agent: |
Mutt/1.5.21 (2010-09-15) |
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On Thu, Apr 21, 2016 at 09:04:32PM +0200, Lele Gaifax wrote:
> Hi,
>
> is there a particular reason for the slightly different names that Emacs
> (version 25.0.92) and Python (version 3.6.0a0) give to a single Unicode
> entity?
>
> Just to mention one codepoint, ⋖ is called "LESS THAN WITH DOT" accordingly to
> Emacs' C-x 8 RET TAB menu, while in Python:
>
> >>> import unicodedata
> >>> unicodedata.name('⋖')
> 'LESS-THAN WITH DOT'
> >>> print("\N{LESS THAN WITH DOT}")
> File "<stdin>", line 1
> SyntaxError: (unicode error) ...: unknown Unicode character name
FWIW, "my" Emacs [1] says:
Character code properties: customize what to show
name: LESS-THAN WITH DOT
old-name: LESS THAN WITH DOT
That means the spelling without the dash seems to be somewhat oldish.
[1] GNU Emacs 25.1.50.1 (x86_64-unknown-linux-gnu, GTK+ Version 2.24.29)
of 2016-03-07
regards
- -- tomás
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.12 (GNU/Linux)
iEYEARECAAYFAlcZLKMACgkQBcgs9XrR2kbW9wCfbXrqFKi0q8H4PZihI4hyObyg
SHkAn3zur28ELYGDnnOmdSJcEEAy4a2b
=im17
-----END PGP SIGNATURE-----