NadarayaWatsonin - Infinite Lexicon - Infinite Lexicon

NadarayaWatsonin

The Nadaraya–Watson estimator is a nonparametric method for estimating the conditional mean function m(x) = E[Y|X = x] in a regression setting. It is based on kernel smoothing and is used when the relationship between X and Y is complex or unknown.

Given a sample (Xi, Yi), i = 1,…,n, the Nadaraya–Watson estimator of m at a point x is

m̂(x) = [sum_{i=1}^n K_h(x − Xi) Yi] / [sum_{i=1}^n K_h(x − Xi)]

where K_h(u) = K(u/h)/h, h > 0 is the bandwidth and K is a nonnegative kernel function, typically

Key properties and practical aspects include:

- Bandwidth selection: h controls bias and variance; small h reduces bias but increases variance, while large

- Consistency: under mild regularity conditions, if h → 0 and nh → ∞ as n → ∞, m̂(x) converges to m(x)

- Bias and variance: bias depends on the curvature of m, often approximated by (h^2/2) m''(x) μ2(K); variance

- Boundary effects: estimation near the boundaries can be biased; boundary kernels or local linear variants can

Extensions include multivariate inputs with product kernels and connections to local constant and local linear regression.

a

a

a

h

cross-validation

a