KNNImputation - Infinite Lexicon - Infinite Lexicon

KNNImputation

KNNImputation, short for k-nearest neighbors imputation, is a method for filling in missing values in a dataset by leveraging the values of similar observations. It is non-parametric and relies on the idea that observations with similar patterns across other features are likely to share similar values for the missing feature.

The typical procedure: for each instance with a missing entry in a given feature, compute distances to

Considerations: It benefits from proper feature scaling, can handle numerical features, and tends to preserve local

Limitations: It may be inefficient on very large datasets; its assumptions may be violated if missingness depends

k

a

implementations

standardization.

k

k

k