casefolds - Infinite Lexicon - Infinite Lexicon

casefolds

Casefolds refer to the process of converting text to a form suitable for case-insensitive comparison by applying Unicode case folding rules. Case folding is designed to be language- and locale-independent, enabling consistent caseless matching across scripts and alphabets. It is defined in the Unicode Standard as a form of mapping from code points to a caseless representation, and includes not only simple lowercase mappings but also expansions where a single character becomes multiple letters, as well as ligature expansions.

In practice, casefolding often differs from simple lowercasing. For example, the German character ß lowercases to

Implementation and usage: Many programming environments provide a casefold function or method. In Python, strings expose

Relationship to normalization: Case folding is distinct from Unicode normalization; both may be applied, but for

“ﬁ”

a

case-insensitive

“ß”.casefold()

->

a

a

transformation.