mittetähistatud
Mittetähistatud is an Estonian term meaning "unlabeled" or "not marked." It is commonly used in data science, machine learning, and linguistics to describe data that lacks explicit annotations such as class labels, tags, or segmentation boundaries. The opposite term tähistatud denotes data that has been annotated or labeled. Mittetähistatud data can include text corpora, images, audio, or video and is often abundant compared with labeled data.
In natural language processing and computer vision, mittetähistatud data is used for self-supervised and unsupervised learning.
Challenges and strategies: Because there is no ground-truth labeling, evaluating models trained on mittetähistatud data relies
See also: Labeled data, Annotation, Semi-supervised learning, Unsupervised learning, Pseudo-labeling, Self-supervised learning.