Xpca

Xpca, short for eXtended Principal Component Analysis, is a family of dimensionality-reduction methods that generalize classical principal component analysis (PCA) to data that do not follow a Gaussian distribution. XPCA adopts a probabilistic latent-variable framework in which observed entries arise from a distribution in the exponential family conditioned on low-dimensional latent factors. The data matrix is modeled as the sum of a low-rank signal and noise, with the latent factors capturing the major structure.

Estimation typically proceeds via maximum likelihood or Bayesian inference, using algorithms such as expectation-maximization or alternating

XPCA is particularly suited to datasets where Gaussian assumptions are violated, such as survey responses with

Choosing the number of components is typically done by cross-validation or information criteria; the method is

XPCA relates to broader generalized PCA approaches and probabilistic PCA variants, and it complements matrix factorization

reconstructions

computationally

a