klassenimbalance
Klassenimbalance, in supervised machine learning often simply called class imbalance, describes a situation where one class occurs much more frequently than others. In binary problems, the positive or minority class is underrepresented; in multi-class problems, several classes may be rare. Such distributions reflect many real-world tasks, including fraud detection, medical diagnosis, and anomaly screening.
The main consequence of imbalanced data is that models tend to favor the majority class. This can
Commonly used measurements include the class distribution ratio, recall, precision, F1-score, and the geometric mean of
Several strategies exist to address klassenimbalance. Data-level approaches include oversampling the minority class (random oversampling, SMOTE,