amongphrases
Amongphrases is a term used in linguistics to describe a framework or dataset that collects phrases recurring across languages and cultures, focusing on idioms, proverbs, and formulaic language. It aims to illuminate cross-linguistic regularities in meaning, form, and usage, and to support translation, language teaching, and natural language processing.
Origin and scope: The concept emerged from multilingual corpus projects in the early 21st century and has
Methodology: Researchers compile large multilingual corpora, annotate phrases by semantic domain, and apply cross-lingual alignment techniques
Classification and examples: Entries are often grouped by domain, such as time, weather, social interaction, or
Applications and criticism: Amongphrases informs machine translation, parallel corpora construction, and language education by providing cross-linguistic
See also: cross-linguistic formulaic language, idiom, collocation, language resources, natural language processing.