Home

lowercasing

Lowercasing is the process of converting all alphabetic characters in a string to their lowercase forms. It is widely used to normalize text for case-insensitive comparison, indexing, search, and user input validation. In simple environments that use ASCII, lowercase conversion maps the uppercase letters A through Z to a-z while leaving non-letter characters untouched.

In Unicode, lowercasing relies on mappings that cover many scripts beyond Latin. Some mappings are not one-to-one

Practically, lowercasing is performed by applying a character mapping table to the input stream, often in a

or
are
locale-dependent.
For
example,
Turkish
has
I
and
İ
with
distinct
lowercase
forms:
the
uppercase
I
maps
to
ı
(dotless
i),
and
the
uppercase
İ
maps
to
i.
German
uses
the
letter
ß,
which
lowercases
to
ß;
historically
uppercase
forms
were
written
as
SS,
and
Unicode
has
introduced
a
capital
sharp
S
(ẞ)
that
lowercases
to
ß.
Implementations
must
choose
between
language-aware
rules
or
language-agnostic
ones;
many
libraries
offer
a
locale
option
or
provide
a
default
language-neutral
mapping.
There
is
also
the
concept
of
case
folding,
which
is
a
more
aggressive
form
used
for
caseless
comparisons
and
aims
to
be
language-insensitive,
potentially
mapping
multiple
code
points
to
a
common
form.
single
pass,
with
time
complexity
proportional
to
the
string
length.
It
is
a
standard
feature
in
programming
languages,
text-processing
libraries,
databases,
and
search
engines.
Correct
handling
of
multi-character
mappings
and
locale-specific
rules
is
important
for
accurate
text
processing
in
multilingual
contexts.