Keresztvalidáció
Keresztvalidáció, in English cross-validation, is a resampling procedure used to evaluate machine learning models on a limited data sample. It helps in assessing how well a model generalizes to an independent dataset. The most common form is k-fold cross-validation. In k-fold cross-validation, the original data is randomly partitioned into k equal-sized subsets, called folds. The model is then trained k times. Each time, one of the k folds is used as the test set, and the remaining k-1 folds are used as the training set. The performance metrics from these k runs are then averaged to provide an estimate of the model's performance on unseen data.
This method is crucial for preventing overfitting, a phenomenon where a model learns the training data too