0x00000x10FFFF
0x00000x10FFFF is the hexadecimal range that defines all Unicode code points, from U+0000 to U+10FFFF. This space contains 1,114,112 code points, organized into 17 planes. The Basic Multilingual Plane (BMP) is Plane 0 (U+0000 to U+FFFF); Planes 1 through 16 (U+10000 to U+10FFFF) hold the supplementary characters, including most emoji and historic scripts.
Code point distribution and planes
- BMP (Plane 0): U+0000..U+FFFF
- Supplementary planes (Planes 1–16): U+10000..U+10FFFF
- UTF-8 uses 1 to 4 bytes per character. Code points up to U+7F use 1 byte; up
- UTF-16 encodes code points U+0000..U+FFFF (except the surrogate range) as a single 16-bit unit; code points
- Surrogate codes U+D800–U+DFFF are not valid code points themselves; they are used only in UTF-16 encoding.
- Noncharacters include U+FDD0..U+FDEF and all code points ending in FFFE or FFFF (across planes), such as
- Private-use areas are available for custom definitions: BMP U+E000..U+F8FF; Plane 15 U+F0000..U+FFFFD; Plane 16 U+100000..U+10FFFD.
- The range underpins all Unicode text, including modern symbols, emojis, and historic scripts.
- Some code points are reserved as noncharacters or designated for private use; others are assigned to