What's that Unicode character in my clipboard?

Question

Is there a quick and easy way to find the Unicode code point for any character? For example, I see a funny character on a web page, or a PDF file, or some other document.

What I current do is copy the character to the clipboard, save it to a file, and look at the file with a hex viewer. Alternatively I can open Microsoft Word, paste and do Alt+X. Both of these methods are a bit cumbersome. Is there an easier way?

I use Notepad++ so if there's any way to do that with Notepad++, it would be a suitable answer (it's less cumbersome than having to open Word). Or maybe there's a way to do it with a small specialised application?

score 35 · Accepted Answer · answered Dec 15 '17 at 12:07

35

I work a lot with Unicode characters, so I have written a small Windows application specifically for this:

Unicode Character Informer (Documentation)

In addition, my text editor, Rejbrand Text Editor, has extensive Unicode character support.

answered Dec 15 '17 at 12:07

Andreas Rejbrand

865

score 34 · Answer 2 · answered Dec 15 '17 at 10:34

Notepad++ has a pre-installed plug-in called Converter that has a option to Convert ASCII to HEX and Vice-versa. This tool is quite useful as to convert data files that are in HEX format which are to be converted to ASCII to read:

That is how it works:

score 28 · Answer 3 · answered Dec 15 '17 at 14:28

28

There's a nice little website called Unicode Character Inspector (built by Tim Whitlock) that does just that. I find it way more convenient than a text editor or desktop program.

answered Dec 15 '17 at 14:28

Baptiste Candellier

406

score 18 · Answer 4 · answered Dec 15 '17 at 21:57

When I'm faced with this problem, a quick Google search usually provides a quick answer. For example, when I google " unicode", I get a result like this:

I like this method because:

It works on any computer with internet
You don't have to install anything
The keypresses required (Ctrl+C & Ctrl+T & Ctrl+V & Enter) are muscle memory actions for me, and probably for most other developers/typists.

score 9 · Answer 5 · answered Dec 15 '17 at 21:44

On a Unix-like system*:

unicode -s "$(xsel -ob)"

You can alias this or create a script to run it.

The output looks like this:

U+2672 UNIVERSAL RECYCLING SYMBOL
UTF-8: e2 99 b2 UTF-16BE: 2672 Decimal: &#9842; Octal: \023162
♲ (♲)
Uppercase: 2672
Category: So (Symbol, Other)
Bidi: ON (Other Neutrals)

* It looks like the original poster is probably using Windows, but (a) this isn't specified, and (b) this solution might help others.

score 8 · Answer 6 · answered Dec 15 '17 at 22:36

I find Rishard Ishida's Unicode code converter (github link) very usefull for finding unicode charactercodes, amongst other things. It also provides translations/conversions to other codepoints, encodings and for instance escapes-sequences.

You may also want to checkout Richard Ishida's main webpage (rishida.net), as it contains (links to) alot of valuable tools and information, especially if you're interested in internationalisation and character-encoding. For instance, another very useful tool linked there, is his Uniview tool (github link).

And finally, also very useful i find, although mostly relevant to Mac-users, is macOS's Character Viewer, accessible through the Input Menu, which can be enabled in System Preferences → Keyboard

Although the Apple-support website mainly focusses on how-to insert emojies (…), the Character Viewer is actually very useful for looking-up specific ('special') characters and their codepoints in several different encodings, as well as for finding which fonts on your systen contain specific glyphs.

Cheers!

score 6 · Answer 7 · answered Dec 16 '17 at 18:01

You can use PowerShell!

[char]::ConvertToUtf32((gcb), 0)

This prints the first Unicode code point of the text on the clipboard.

If you don't have to worry about characters outside the Basic Multilingual Plane (that would be represented in .NET strings as a high and low surrogate), you can use this instead:

[int](gcb)[0]

If you'd prefer it in hex, you can use a format specifier:

'0x{0:x}' -f [char]::ConvertToUtf32((gcb), 0)

score 6 · Answer 8 · answered Dec 17 '17 at 10:17

A note for any Emacs users: you can type C-u C-x = and it will give you a bunch of information about the character under the cursor, including the Unicode code point, the name in the Unicode database and the categories etc.

             position: 146 of 147 (99%), column: 0
            character: ♲ (displayed as ♲) (codepoint 9842, #o23162, #x2672)
    preferred charset: unicode (Unicode (ISO10646))
code point in charset: 0x2672
               script: symbol
               syntax: w    which means: word
             category: .:Base
             to input: type "C-x 8 RET 2672" or "C-x 8 RET UNIVERSAL RECYCLING SYMBOL"
          buffer code: #xE2 #x99 #xB2
            file code: #xE2 #x99 #xB2 (encoded by coding system utf-8-unix)
              display: by this font (glyph code)
    xft:-PfEd-Mensch-normal-normal-normal-*-16-*-*-*-m-0-iso10646-1 (#x985)

Character code properties: customize what to show
  name: UNIVERSAL RECYCLING SYMBOL
  general-category: So (Symbol, Other)
  decomposition: (9842) ('♲')

score 4 · Answer 9 · edited Oct 31 '23 at 18:58

4

I use http://unicode.scarfboy.com, which is simple and works well.

One thing this website supports is looking up a specific entered Unicode character. If you paste the character from the clipboard and hit enter, it will identify the character.

edited Oct 31 '23 at 18:58

M. Justin

203

answered Dec 15 '17 at 18:45

IridescenceDeep

141

score 4 · Answer 10 · answered Dec 17 '17 at 13:04

4

You can also use the following site: https://unicode-table.com/en/ Just paste your character, and you'll get a Unicode code point and HTML code as well.

answered Dec 17 '17 at 13:04

Alina Ladygina

41

score 4 · Answer 11 · answered Dec 17 '17 at 17:04

4

Got Vim? Just paste it in, put your cursor on it, and hit ga. I use this all the time for weird characters.

answered Dec 17 '17 at 17:04

SilverWolf

221

DodgyCodeException · Answer 12 · 2017-12-18T14:20:14.183

2

Here's one more answer using an idea from user202729:

Bookmark the URL javascript:alert(prompt().codePointAt(0).toString(16)) and use a browser to run it. (Works on Chrome and Firefox. Doesn't appear to work on IE but this may be due to security settings.)

Unlike other answers, no internet connection is required, no external utility to download, not OS-specific.

edited Dec 18 '17 at 14:20

answered Dec 18 '17 at 14:14

DodgyCodeException

855

score 2 · Answer 13 · answered Apr 29 '25 at 19:26

I can't believe nobody has suggested this lookup gem yet as its what many of us really want: https://util.unicode.org/UnicodeJsps/character.jsp?a=0002

It includes ALL Unicode characters, all Unicode data on them, pictures of them for browsers without font support, and is constantly updated to the latest Unicode standard as its an official tool by the Unicode consortium.

What's that Unicode character in my clipboard?

15 Answers15