OPLS

OPLS stands for orthogonal projections to latent structures. It is a multivariate statistical method used to model the relationship between two data matrices, X (predictors) and Y (responses). It was introduced by Trygg and Wold in 2002 as an extension of partial least squares (PLS) regression. The key idea is to partition the variation in X into a predictive part that is correlated with Y and one or more orthogonal parts that are uncorrelated with Y. This separation isolates the signal of interest from unrelated variation, improving model interpretability and often predictive performance.

In OPLS, the model aims to maximize the covariance between the predictive X scores and Y, while

Applications are widespread in chemometrics, especially with metabolomics, spectroscopy, and other omics datasets, where high collinearity

Advantages include improved interpretability, clearer separation of predictive versus nonpredictive variation, and often better cross-validated performance.

X

a

Y

a

cross-validation