PixelRNN

PixelRNN is an autoregressive generative model for images introduced in 2016 by Aaron van den Oord, Nal Kalchbrenner, and Koray Kavukcuoglu. It treats an image as a sequence of pixels and models the joint distribution P(X) as the product of conditional distributions P(xi | x1, x2, ..., x(i-1)) in raster-scan order. The approach aims to capture long-range dependencies in both spatial directions, enabling the generation of high-fidelity samples and providing a tractable density estimate for natural images.

The architecture relies on two-dimensional recurrent networks to propagate information across the image grid. It uses

Training and generation are performed by maximum likelihood estimation, using backpropagation through time to train the

Impact and scope: PixelRNN contributed to the development of autoregressive density estimators for images, influencing later

a

a

non-autoregressive

a