idiomil
Idiomil is a hypothetical linguistic concept used to quantify the idiomaticity of multiword expressions in natural language. The term is not part of an established metric, but appears in discussions of figurative language and computational modeling as a way to compare how entrenched or conventionalized an expression is across languages or corpora.
Definition and scope: An idiomil is defined as a dimensionless score on a 0 to 1 scale.
Measurement: Proposals for estimating idiomil combine corpus-based statistics, human judgments, and model-based estimates. Common approaches include
Applications: Idiomil concepts are used to evaluate language models on idiom handling, to study cross-linguistic variation
Limitations: The concept faces challenges such as subjectivity in judgments, dependence on context, domain and register
See also: Idiom, multiword expression, figurative language, compositionality, lexicalized expression.