ristivalidointi
Ristivalidointi, or cross-validation, is a statistical resampling method used to assess how a predictive model generalizes to an independent dataset. The approach partitions the available data into training and validation sets multiple times and aggregates the resulting performance measures to obtain a more reliable estimate than a single train-test split.
The most common form is k-fold cross-validation. The data are split into k folds of roughly equal
Variants include stratified cross-validation, which preserves class proportions in each fold and is useful for imbalanced
Advantages include efficient use of data, reduced variance in the estimated performance, and broad applicability to
Ristivalidointi is widely used to compare models, select hyperparameters, and report performance estimates in machine learning