Spelformer
Spelformer is a term used in computational linguistics to describe a class of tools and algorithms that generate alternative orthographic forms for a given base word or lemma. A spelformer takes a canonical spelling as input and outputs a set of spelling variants that may be appropriate in a given language, dialect, or historical period. The goal is to capture the range of legitimate spellings that can appear in text, documents, or user queries.
Spelformers can be constructed using rule-based methods, statistical models, or neural approaches, often combining phonological-to-orthographic mappings
Applications include historical text digitization and restoration, optical character recognition post-processing, search and information retrieval, linguistic
Examples in English include variants such as colour versus color, theatre versus theater, and analyse versus
Limitations include overgeneration of unlikely forms, dependence on high-quality linguistic resources, and potential confusion when used