cotraining

Co-training is a semi-supervised machine learning technique in which two or more classifiers are trained on different, conditionally independent views of the same data and iteratively teach each other using unlabeled data. The approach aims to improve learning when labeled data are scarce by exploiting abundant unlabeled examples.

Typically, the feature space is split into two views that are each predictive of the class on

Algorithm: start with a small labeled set and train a separate classifier on each view. Apply the

Assumptions and considerations: co-training relies on two main conditions: (1) each view is sufficient for predicting

Applications include text classification, web page categorization, image recognition with multi-modal features, and bioinformatics. Limitations include

a

semi-supervised

a

a

co-regularization,

agreement-based