Canonicalization
Canonicalization is the process of converting data into a standard or canonical form. The goal is to remove variation that is not essential to the meaning or function of the data, so that two representations that convey the same information can be treated as equivalent.
In computing and text processing, canonicalization includes Unicode normalization (such as NFC, NFD, NFKC, NFKD) to
In information retrieval and web technologies, canonicalization helps prevent duplicate content from diluting search relevance. Canonical
In data management, canonicalization maps diverse data into a canonical data model or schema. This supports
In security, improper canonicalization can create vulnerabilities such as path traversal or input bypass, where different
In mathematics and computer algebra, a canonical form is a standard representation that uniquely identifies an
See also: normalization, data normalization, Unicode normalization, URL normalization, deduplication.