labelvary
Labelvary is a term used in data annotation and machine learning to describe the variability observed in label assignments across annotators, contexts, or models. It captures how consistently items are labeled and how the chosen labels are distributed among available categories. While not a formal standard in the literature, labelvary is used to discuss label uncertainty and label noise beyond single-annotator judgments.
Per-item labelvary can be quantified in several ways. One common approach is to compute the entropy of
Labelvary has practical applications in data curation and quality control, helping teams target items for review
Limitations and considerations accompany labelvary. It is sensitive to context, label taxonomy, and annotator instructions, so