Unicode#


Table of Contents#


Symbol

HTML Entity

Unicode Code Point

Unicode Name

←, ←, ←

U+2190

LEFTWARDS ARROW

↑, ↑, ↑

U+2191

UPWARDS ARROW

→, →, →

U+2192

RIGHTWARDS ARROW

↓, ↓, ↓

U+2193

DOWNWARDS ARROW

↩, ↩

U+21A9

LEFTWARDS ARROW WITH HOOK

↵, ↵, ↵

U+21B5

DOWNWARDS ARROW WITH CORNER LEFTWARDS (carriage return symbol)

↹, ↹

U+21B9

LEFTWARDS ARROW TO BAR OVER RIGHTWARDS ARROW TO BAR (tab with shift tab)

&#86??;, ⇆

U+21C6

LEFTWARDS ARROW OVER RIGHTWARDS ARROW

⇤, ⇤

U+21E4

LEFTWARDS ARROW TO BAR (leftward tab)

⇥, ⇥

U+21E5

RIGHTWARDS ARROW TO BAR (rightward tab)

⇧, ⇧

U+21E7

UPWARDS WHITE ARROW (shift)

⇪, ⇪

U+21EA

UPWARDS WHITE ARROW FROM BAR (caps lock)

⌃, ⌃

U+2303

UP ARROWHEAD

⌅, ⌅

U+2305

PROJECTIVE

⌘, ⌘

U+2318

PLACE OF INTEREST SIGN (command key)

⌤, ⌤

U+2324

UP ARROWHEAD BETWEEN TWO HORIZONTAL BARS (enter key)

⌥, ⌥

U+2325

OPTION KEY

⌫, ⌫

U+232B

ERASE TO THE LEFT (delete to the left key)

⎇, ⎇

U+2387

ALTERNATIVE KEY SYMBOL (alt key)

⎋, ⎋

U+238B

BROKEN CIRCLE WITH NORTHWEST ARROW (escape)

⏏, ⏏

U+23CF

EJECT SYMBOL

, 

U+F8FF


How many Unicode code points are there?

0x10FFFF
1114111
cases = [chr(c) for c in list(range(8592, 9251)) + [
  63743,
]] + [
  '\u002D',
  '\u2013',
  '\u2014',
  '\u2060', # replaces U+FEFF qua deprecated ZERO WIDTH NO-BREAK SPACE
  '\uFEFF', # BOM
  '\uFFEF', # BOM, byte-swapped (illegal sequence that may not appear in Unicode text)
  #string.printable,
]
print_code_point_information(cases)
---------------------------------------------------------------------------
NameError                                 Traceback (most recent call last)
Cell In[2], line 12
      1 cases = [chr(c) for c in list(range(8592, 9251)) + [
      2   63743,
      3 ]] + [
   (...)
     10   #string.printable,
     11 ]
---> 12 print_code_point_information(cases)

NameError: name 'print_code_point_information' is not defined

East Asian Languages#

  • 2E80-2EFF CJK Radicals Supplement Unicode

  • 2F00-2FDF Kangxi Radicals Unicode

  • 2FE0-2FEF UNDEFINED

  • 2FF0-2FFF Ideographic Description Characters Unicode

  • 3000-303F CJK Symbols and Punctuation Unicode

  • 3040-309F Hiragana Unicode Wiki

  • 30A0-30FF Katakana Unicode Wiki

  • 3100-312F Bopomofo Unicode

  • 3130-318F Hangul Compatibility Jamo Unicode

  • 3190-319F Kanbun Unicode

  • 31A0-31BF Bopomofo Extended Unicode

  • 31C0-31EF CJK Strokes Unicode

  • 31F0-31FF Katakana Phonetic Extensions Unicode Wiki

  • 3200-32FF Enclosed CJK Letters and Months Unicode

  • 3300-33FF CJK Compatibility Unicode

  • 3400-4DBF CJK Unified Ideographs Extension A Unicode

  • 4DC0-

  • 4E00-9FFF CJK Unified Ideographs Unicode

  • A000-

  • F900-FAFF CJK Compatibility Ideographs Unicode

  • FB00-

  • FE50-FE6F Small Form Variants Unicode

  • FE70-

  • FF00-FFEF Halfwidth and Fullwidth Forms Unicode

  • FFF0-

  • 1AFF0-1AFFF Kana Extended B Unicode

  • 1B000-1B0FF Kana Supplement Unicode

  • 1B130-1B16F Small Kana Extension Unicode

  • 20000-2A6DF CJK Unified Ideographs Extension B Unicode

CJK Radicals Supplement#

# start  = 0x2E80
# stop   = 0x2EFF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

CJK Symbols and Punctuation#

# start  = 0x3000
# stop   = 0x303F
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

Hiragana#

# start  = 0x3040
# stop   = 0x309F
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

Katakana#

# start  = 0x30A0
# stop   = 0x30FF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

Katakana Phonetic Extensions#

# start  = 0x31F0
# stop   = 0x31FF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

Enclosed CJK Letters and Months#

# start  = 0x3200
# stop   = 0x32FF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

CJK Compatibility#

# start  = 0x3300
# stop   = 0x33FF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

Small Form Variants#

# start  = 0xFE50
# stop   = 0xFE6F
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

Halfwidth and Fullwidth Forms#

# start  = 0xFF00
# stop   = 0xFFEF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

CJK Unified Ideographs Extension A#

# start  = 0x3400
# stop   = 0x4DBF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

CJK Unified Ideographs#

# start  = 0x4E00
# stop   = 0x9FFF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

CJK Compatibility Ideographs#

# start  = 0xF900
# stop   = 0xFAFF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)

Resources#

  • [ d ] Unicode Charts


Terms#

  • [ w ] Arrow