characterset
A characterset, also written as character set, is a defined collection of characters that a system can recognize, display, or process. Each character in the set is associated with a code point, a numeric value used by software to identify the character.
A characterset is distinct from a character encoding. The encoding is the method by which code points
Common examples include ASCII, a 7-bit set of 128 characters; ISO/IEC 8859 series and Windows-1252, which extend
UTF-8 is the most widely used encoding on the web and is backward compatible with ASCII, using
In modern computing, Unicode is commonly adopted as the character set, with UTF-8 as the preferred encoding