semiüberwachte
Semiüberwachte, in German-language literature often written as semi-supervised learning, refers to learning paradigms that use a small amount of labeled data together with a large amount of unlabeled data. The aim is to improve generalization when labeling is costly or impractical by exploiting the structure present in the unlabeled data. It sits between supervised learning (labeled data only) and unsupervised learning (unlabeled data only).
Common approaches include self-training or pseudo-labeling, where a model trained on labeled data assigns labels to
Key assumptions behind semiüberwachte methods include the cluster assumption (points in the same cluster share a
Applications span image and video recognition, natural language processing, and speech analysis, especially where labeling is