help-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: How to get the script name symbols of a specific character?


From: YE Qianchuan
Subject: Re: How to get the script name symbols of a specific character?
Date: Mon, 11 Feb 2013 23:17:58 +0800
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130109 Thunderbird/17.0.2

On 02/11/2013 11:07 PM, YE Qianchuan wrote:
On 02/11/2013 07:34 PM, Jambunathan K wrote:
Put your cursor on the box and type
         C-u C-x =
In fact, it's the same as `describe-char'. This command invokes
`what-cursor-position', which invokes `describe-char' eventually.

It will give more useful pointers.  The codepoint of a particular
character.  The name of the character, in the example below is prefixed
by the script it comes from etc.
Cool, I didn't notice its name may be prefixed by its script. It does make a lot sense.

However sadly, not all characters do so. For example, a CJK character has prefix CJK. But cjk is not a script name (though there's a script called cjk-misc) and it should belong
to `han'.

What's worse is, some characters don't show their names at all, even if I assign a font to it.

For example:
             position: 806 of 1031 (78%), column: 1
character: 😀 (displayed as 😀) (codepoint 128512, #o373000, #x1f600)
    preferred charset: unicode (Unicode (ISO10646))
code point in charset: 0x1F600
               syntax: w     which means: word
             category: L:Left-to-right (strong)
          buffer code: #xF0 #x9F #x98 #x80
file code: #xF0 #x9F #x98 #x80 (encoded by coding system utf-8-unix)
              display: no font available

Character code properties: customize what to show
  general-category: Cn (Other, Not Assigned)
  decomposition: (128512) ('😀')

Additional example, this character's name don't show any connection to its script.
How do you get its script symbol?

             position: 870 of 1031 (84%), column: 65
character: 😠 (displayed as 😠) (codepoint 128544, #o373040, #x1f620)
    preferred charset: unicode (Unicode (ISO10646))
code point in charset: 0x1F620
               syntax: w     which means: word
             category: .:Base
          buffer code: #xF0 #x9F #x98 #xA0
            file code: #xF0 #x9F #x98 #xA0
               (encoded by coding system utf-8-unix)
              display: by this font (glyph code)
xft:-unknown-Symbola-normal-normal-semi-condensed-*-15-*-*-*-*-0-iso10646-1 (#x1ADE)

Character code properties: customize what to show
  name: ANGRY FACE
  general-category: So (Symbol, Other)
  decomposition: (128544) ('😠')




reply via email to

[Prev in Thread] Current Thread [Next in Thread]