Codepoint - Infinite Lexicon - Infinite Lexicon

Codepoint

A code point is a numeric value that uniquely identifies a character in a character set or encoding system. In Unicode, code points are abstract identifiers for characters and are typically written as a value with the prefix U+, followed by hexadecimal digits. For example, U+0041 designates the Latin capital letter A, and U+1F600 designates the grinning face emoji. A code point is not the character itself and not a specific byte sequence; it is an abstract reference used by software to identify a character.

Code points are mapped to bytes by encodings. Unicode defines several encoding forms, such as UTF-8, UTF-16,

Unicode assigns code points in the range U+0000 to U+10FFFF, organized into planes, with the Basic Multilingual

Code points are distinct from glyphs—the shapes produced by fonts. A single code point may render differently

a