metatests

Metatests are evaluations of testing procedures themselves rather than the data analyses they produce. In statistics and psychometrics, a metatest examines properties such as the Type I error rate, power, calibration, robustness to model misspecification, and measurement validity or reliability across a range of conditions. The term is sometimes used more broadly to describe assessments of software test suites or other quality-assurance tests that judge the adequacy of tests rather than the subject data.

Methodologically, metatesting typically relies on simulation studies, where data are generated under controlled scenarios, or on

Purposes include selecting appropriate testing procedures for given data-generating processes, understanding sensitivity to assumptions, and guiding

Limitations include reliance on the realism and breadth of simulated or benchmark scenarios, potential bias in

A

permutation-based

distribution-free

recommendations