Unicodeenkoodaus
Unicodeenkoodaus refers to the process of representing Unicode characters as sequences of bytes. Unicode is a universal character encoding standard designed to represent every character in virtually every writing system, as well as symbols and emojis. However, computers store and transmit data as bytes, which are groups of bits. Therefore, a method is needed to translate the abstract Unicode code points into concrete byte sequences.
Several different Unicode encoding forms exist, each with its own rules for converting code points into bytes.
UTF-16 is also a variable-width encoding, typically using two or four bytes per character. It is widely