KNNImputation
KNNImputation, short for k-nearest neighbors imputation, is a method for filling in missing values in a dataset by leveraging the values of similar observations. It is non-parametric and relies on the idea that observations with similar patterns across other features are likely to share similar values for the missing feature.
The typical procedure: for each instance with a missing entry in a given feature, compute distances to
Considerations: It benefits from proper feature scaling, can handle numerical features, and tends to preserve local
Limitations: It may be inefficient on very large datasets; its assumptions may be violated if missingness depends