percharacter

Percharacter, often written as per-character or character-level, is a term used in computing, linguistics, and typography to denote operations, metrics, or models that operate at the level of individual characters rather than words or subword units.

In text processing and OCR/ASR evaluation, per-character metrics include character error rate (CER) and character-level accuracy.

Character-level models process text as sequences of characters. They can capture spelling and long-range dependencies and

In typography and fonts, per-character information concerns glyph metrics, kerning, and rendering of individual characters. In

Limitations include longer sequence lengths and data sparsity for rare characters, as well as Unicode encoding

character-level

representations

character-level