varietieshas
Varietieshas is a term used in linguistic typology and data modeling to denote the presence of a particular feature across a set of language varieties. In data schemas, varietieshas can function as a predicate that summarizes feature distribution among varieties, enabling comparisons and statistical analysis. The concept emphasizes cross-variety coverage rather than analysis of any single language.
Formally, varietieshas(F, V, d) is considered true when the proportion of varieties in V that realize feature
Common applications include typological surveys, feature matrices in linguistic databases, and input for machine learning models
Limitations include dependency on data quality, feature definitions, and sample composition. The choice of threshold τ and
See also: linguistic universals, areal features, feature matrices, language typology.