ConceptDrift
Concept drift refers to a change in the statistical properties of the target variable that a predictive model is trying to predict, over time. In practical terms, the relationship between inputs and outputs or the distribution of inputs themselves can shift, causing a model that was once accurate to perform poorly on new data. Concept drift is common in real-world data streams where the underlying data-generating process evolves.
Drift can be caused by changes in the world that alter how data are generated. Common types
- Sudden drift: an abrupt change in the data distribution or the predictive relationship.
- Gradual drift: a slow, steady shift over time.
- Incremental drift: a series of small changes accumulating to a noticeable difference.
- Recurring drift: previously observed concepts reappear after a period of absence.
Related distinctions include covariate shift (changes in the distribution of input features), prior probability shift (changes
Drift detection involves monitoring model performance or changes in data distributions to decide when to update
To cope with concept drift, practitioners use online learning, periodic retraining, or incremental model updates. Ensemble
Concept drift challenges model validity in non-stationary environments. Evaluation requires time-aware metrics and rolling assessments, ensuring