Multivariate techniques based on projection methods such as Principal Component Analysis and Partial Least Squares (PLS) regression are widely applied in metabolomics. However, the effects of confounding factors and the presence of specific clusters in the data could force the projection to produce inefficient representations in the latent space, preventing the identification of the most relevant data variation. To overcome this issue, we introduce a general framework for projection methods, allowing an easy integration of orthogonal constraints, which help in reducing the effect of uninformative variations. In particular, the discussed algorithms address different scenarios. When known confounding factors can be explicitly encoded into a proper constraint matrix, orthogonally Constrained Principal Component Analysis (oCPCA) and orthogonally Constrained PLS2 (oCPLS2) can be used. Orthogonal PLS (OPLS) and post‐transformation of PLS2 (ptPLS2), instead, are suited to problems in which a constraint matrix cannot be defined. Finally, a data integration task is considered: Orthogonal two‐block PLS (O2PLS) and Orthogonal Wold's two‐block Mode A PLS (OPLS‐W2A) are used to identify the common variation between two data sets

Stocchero, M.; Riccadonna, S.; Franceschi, P. (2018). Projection to latent structures with orthogonal constraints for metabolomics data. JOURNAL OF CHEMOMETRICS, 32 (5): e2987. doi: 10.1002/cem.2987 handle: http://hdl.handle.net/10449/42287

Projection to latent structures with orthogonal constraints for metabolomics data

Riccadonna, S.
Penultimo
;
Franceschi P.
Ultimo
2018-01-01

Abstract

Multivariate techniques based on projection methods such as Principal Component Analysis and Partial Least Squares (PLS) regression are widely applied in metabolomics. However, the effects of confounding factors and the presence of specific clusters in the data could force the projection to produce inefficient representations in the latent space, preventing the identification of the most relevant data variation. To overcome this issue, we introduce a general framework for projection methods, allowing an easy integration of orthogonal constraints, which help in reducing the effect of uninformative variations. In particular, the discussed algorithms address different scenarios. When known confounding factors can be explicitly encoded into a proper constraint matrix, orthogonally Constrained Principal Component Analysis (oCPCA) and orthogonally Constrained PLS2 (oCPLS2) can be used. Orthogonal PLS (OPLS) and post‐transformation of PLS2 (ptPLS2), instead, are suited to problems in which a constraint matrix cannot be defined. Finally, a data integration task is considered: Orthogonal two‐block PLS (O2PLS) and Orthogonal Wold's two‐block Mode A PLS (OPLS‐W2A) are used to identify the common variation between two data sets
Orthogonal PLS
Orthogonal signal correction
orthogonally constrained PLS
PLS regression
Settore CHIM/01 - CHIMICA ANALITICA
2018
Stocchero, M.; Riccadonna, S.; Franceschi, P. (2018). Projection to latent structures with orthogonal constraints for metabolomics data. JOURNAL OF CHEMOMETRICS, 32 (5): e2987. doi: 10.1002/cem.2987 handle: http://hdl.handle.net/10449/42287
File in questo prodotto:
File Dimensione Formato  
Stocchero_et_al-2018-Journal_of_Chemometrics.pdf

solo utenti autorizzati

Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 396.72 kB
Formato Adobe PDF
396.72 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10449/42287
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 15
  • ???jsp.display-item.citation.isi??? 16
social impact