Unicode#
Table of Contents#
Symbol |
HTML Entity |
Unicode Code Point |
Unicode Name |
---|---|---|---|
← |
|
U+2190 |
LEFTWARDS ARROW |
↑ |
|
U+2191 |
UPWARDS ARROW |
→ |
|
U+2192 |
RIGHTWARDS ARROW |
↓ |
|
U+2193 |
DOWNWARDS ARROW |
↩ |
|
U+21A9 |
LEFTWARDS ARROW WITH HOOK |
↵ |
|
U+21B5 |
DOWNWARDS ARROW WITH CORNER LEFTWARDS (carriage return symbol) |
↹ |
|
U+21B9 |
LEFTWARDS ARROW TO BAR OVER RIGHTWARDS ARROW TO BAR (tab with shift tab) |
⇆ |
|
U+21C6 |
LEFTWARDS ARROW OVER RIGHTWARDS ARROW |
⇤ |
|
U+21E4 |
LEFTWARDS ARROW TO BAR (leftward tab) |
⇥ |
|
U+21E5 |
RIGHTWARDS ARROW TO BAR (rightward tab) |
⇧ |
|
U+21E7 |
UPWARDS WHITE ARROW (shift) |
⇪ |
|
U+21EA |
UPWARDS WHITE ARROW FROM BAR (caps lock) |
⌃ |
|
U+2303 |
UP ARROWHEAD |
⌅ |
|
U+2305 |
PROJECTIVE |
⌘ |
|
U+2318 |
PLACE OF INTEREST SIGN (command key) |
⌤ |
|
U+2324 |
UP ARROWHEAD BETWEEN TWO HORIZONTAL BARS (enter key) |
⌥ |
|
U+2325 |
OPTION KEY |
⌫ |
|
U+232B |
ERASE TO THE LEFT (delete to the left key) |
⎇ |
|
U+2387 |
ALTERNATIVE KEY SYMBOL (alt key) |
⎋ |
|
U+238B |
BROKEN CIRCLE WITH NORTHWEST ARROW (escape) |
⏏ |
|
U+23CF |
EJECT SYMBOL |
|
|
U+F8FF |
How many Unicode code points are there?
0x10FFFF
1114111
cases = [chr(c) for c in list(range(8592, 9251)) + [
63743,
]] + [
'\u002D',
'\u2013',
'\u2014',
'\u2060', # replaces U+FEFF qua deprecated ZERO WIDTH NO-BREAK SPACE
'\uFEFF', # BOM
'\uFFEF', # BOM, byte-swapped (illegal sequence that may not appear in Unicode text)
#string.printable,
]
print_code_point_information(cases)
---------------------------------------------------------------------------
NameError Traceback (most recent call last)
Cell In[2], line 12
1 cases = [chr(c) for c in list(range(8592, 9251)) + [
2 63743,
3 ]] + [
(...)
10 #string.printable,
11 ]
---> 12 print_code_point_information(cases)
NameError: name 'print_code_point_information' is not defined
East Asian Languages#
2E80-2EFF CJK Radicals Supplement Unicode
2F00-2FDF Kangxi Radicals Unicode
2FE0-2FEF UNDEFINED
2FF0-2FFF Ideographic Description Characters Unicode
3000-303F CJK Symbols and Punctuation Unicode
3100-312F Bopomofo Unicode
3130-318F Hangul Compatibility Jamo Unicode
3190-319F Kanbun Unicode
31A0-31BF Bopomofo Extended Unicode
31C0-31EF CJK Strokes Unicode
3200-32FF Enclosed CJK Letters and Months Unicode
3300-33FF CJK Compatibility Unicode
3400-4DBF CJK Unified Ideographs Extension A Unicode
4DC0-
4E00-9FFF CJK Unified Ideographs Unicode
A000-
F900-FAFF CJK Compatibility Ideographs Unicode
FB00-
FE50-FE6F Small Form Variants Unicode
FE70-
FF00-FFEF Halfwidth and Fullwidth Forms Unicode
FFF0-
1AFF0-1AFFF Kana Extended B Unicode
1B000-1B0FF Kana Supplement Unicode
1B130-1B16F Small Kana Extension Unicode
20000-2A6DF CJK Unified Ideographs Extension B Unicode
CJK Radicals Supplement#
# start = 0x2E80
# stop = 0x2EFF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
CJK Symbols and Punctuation#
# start = 0x3000
# stop = 0x303F
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
Hiragana#
# start = 0x3040
# stop = 0x309F
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
Katakana#
# start = 0x30A0
# stop = 0x30FF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
Katakana Phonetic Extensions#
# start = 0x31F0
# stop = 0x31FF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
Enclosed CJK Letters and Months#
# start = 0x3200
# stop = 0x32FF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
CJK Compatibility#
# start = 0x3300
# stop = 0x33FF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
Small Form Variants#
# start = 0xFE50
# stop = 0xFE6F
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
Halfwidth and Fullwidth Forms#
# start = 0xFF00
# stop = 0xFFEF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
CJK Unified Ideographs Extension A#
# start = 0x3400
# stop = 0x4DBF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
CJK Unified Ideographs#
# start = 0x4E00
# stop = 0x9FFF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
CJK Compatibility Ideographs#
# start = 0xF900
# stop = 0xFAFF
# points = [chr(c) for c in range(start, stop)]
# print_code_point_information(points)
Resources#
[ d ] Unicode Charts
Terms#
[ w ] Arrow