Unicodecodepunt
A Unicode code point is the basic unit of information in the Unicode character set. It is a numeric value that uniquely identifies a character or a formatting symbol, independent of how the character is stored or displayed. Code points are defined by the Unicode Standard and are platform-, language-, and encoding-agnostic.
Code points are written in the form U+XXXX, using hexadecimal digits. For example, U+0041 denotes the Latin
In actual data, a code point is encoded in one of several Unicode Transformation Formats, such as
A code point is distinct from its rendered glyph. A single code point can map to multiple
Unicode code points are managed by the Unicode Consortium and form the basis for text encoding in