winsorizing
Winsorizing is a statistical data processing technique used to limit the influence of extreme values by replacing them with less extreme values near the tails of the distribution. It is named after Charles P. Winsor and originated in early 20th-century statistics. The method is a form of censoring rather than deleting observations; no data are removed, but extreme values are replaced.
Implementation: choose a tails proportion p (for example 5% or 1%). Compute the lower bound L as
Effects and use: Winsorizing reduces the influence of outliers on statistics such as the mean, variance, and
It is distinct from trimming, which discards outliers, whereas winsorizing caps them. The choice of p and