MLStest
MLStest is a software toolkit designed to enable rigorous evaluation of machine learning models through standardized experimentation and statistical testing. It provides a framework to plan, execute, and compare model performance across datasets and experimental configurations, with an emphasis on reproducibility and statistical validity.
The toolkit offers evaluation metrics for classification and regression, supports common statistical tests (paired and unpaired),
Its architecture is modular, comprising an experiment manager, metric calculators, a statistical testing engine, data adapters,