-
Notifications
You must be signed in to change notification settings - Fork 0
unicode
Alick Zhao edited this page Jan 19, 2016
·
4 revisions
Range: 0x000000–0x10FFFF (0x110000 code points in total)
BMP (Basic Multilingual Plane): 0x0000–0xFFFF
- UTF-8 variable length encoding (1–4 bytes)
- UTF-16 variable length encoding (1–2 bytes), descendant of deprecated UCS-2. Harmful?
- UTF-32 constant length encoding (4 bytes)
Specification: RFC2279
E.g.: 😷
- ☐ U+2610 BALLOT BOX
- ☑ U+2611 BALLOT BOX WITH CHECK
- ☒ U+2612 BALLOT BOX WITH X
My avatar: 🈚 U+1F21A
-
·
U+00B7 MIDDLE DOT 用作中文的间隔号(外国人名等)
See Test charts for tonal pinyin in Unicode Web pages.
A good reading material is Tom Christiansen's slides gbu at OSCON.
本作品采用知识共享署名-相同方式共享 4.0 Unported许可协议进行许可。