spellingtophoneme
Spelling-to-phoneme, commonly known as grapheme-to-phoneme (G2P) conversion, is the process of converting a sequence of written letters (graphemes) into a corresponding sequence of phonemes, the units of sound in speech. The output may include stress marks and syllable boundaries. This task is foundational for many speech technologies, pronunciation lexicons, and language learning tools. In English, spelling-to-phoneme is challenging due to irregular spellings and multiple pronunciations for some spellings.
Languages differ in difficulty for spelling-to-phoneme mapping. Shallow orthographies yield relatively consistent grapheme-to-phoneme correspondences, while deep
Approaches to spelling-to-phoneme include rule-based systems that encode linguistic knowledge, data-driven methods that learn mappings from
Challenges include irregular pronunciations, homographs, allophony, and dialectal variation. Ongoing work aims to improve coverage, cross-language