validatieset
A validatieset, in machine learning and data science, is a subset of labeled data used during model development to estimate the model’s performance and to guide decisions about model selection and hyperparameter tuning. It sits between the training data and the final evaluation data, helping to assess how well a model trained on the training set generalizes to unseen data.
In practice, the available data are often split into three parts: training, validation, and test. The model
Key considerations include avoiding data leakage between the sets, ensuring that splits respect the underlying data