PixelRNN
PixelRNN is an autoregressive generative model for images introduced in 2016 by Aaron van den Oord, Nal Kalchbrenner, and Koray Kavukcuoglu. It treats an image as a sequence of pixels and models the joint distribution P(X) as the product of conditional distributions P(xi | x1, x2, ..., x(i-1)) in raster-scan order. The approach aims to capture long-range dependencies in both spatial directions, enabling the generation of high-fidelity samples and providing a tractable density estimate for natural images.
The architecture relies on two-dimensional recurrent networks to propagate information across the image grid. It uses
Training and generation are performed by maximum likelihood estimation, using backpropagation through time to train the
Impact and scope: PixelRNN contributed to the development of autoregressive density estimators for images, influencing later