Katseandmed
Katseandmed, Estonian for test data, denote the portion of a dataset reserved for evaluating a model, algorithm, or hypothesis after training. Unlike training data, katseandmed are not used to fit the model; they provide an unbiased estimate of performance in deployment conditions.
Typically, data projects split data into training, validation, and test sets. The training set fits the model,
To be effective, katseandmed should reflect the deployment context: representative of the target population or input
Common evaluation metrics on katseandmed depend on the task: classification uses accuracy, precision, recall, F1, and
Ethical and methodological considerations include privacy-preserving handling, de-identification, and attention to biases in the test set.