Partial Least Square Regression (PLS-R) is a method of reducing the dimensionality of the data. PLS-R assumes that there is a common structure underlying the blocks of predictor (X) and response (Y) variables, and that this structure can be resumed by a few latent components, explaining the Y, calculated as a linear combination of the X. The components are obtained as to explain the maximum covariance between the Y and X variables. PLS-R is an iterative algorithm, and in any iteration Y-scores, X-weights, X-scores and Y-weights are sequentially calculated one as a function of the previous one. We applied PLS-R to an Italian case-control study of esophageal cancer in order to identify dietary patterns from 33 food groups and 6 nutrients as X and Y variables, respectively. We performed a logistic regression on PLS-R scores and found a strong direct association between a dietary pattern, characterized by a diet poor in fruit and vegetables, and rich in alcohol, red meat, bread, butter, unspecified seed oils, and the risk of esophageal cancer (odds ratio: 5.64 for the highest quintile compared with the lowest, confidence interval: 2.54-12.54).

Identification of dietary patterns using partial least square regression / V. Rosato, V. Edefonti, A. Decarli, M. Ferraroni. ((Intervento presentato al convegno Statistical Conference on Advances in Latent Variables : Methods, Models and Applications tenutosi a Brescia nel 2013.

Identification of dietary patterns using partial least square regression

V. Rosato
Primo
;
V. Edefonti
Secondo
;
A. Decarli
Penultimo
;
M. Ferraroni
Ultimo
2013

Abstract

Partial Least Square Regression (PLS-R) is a method of reducing the dimensionality of the data. PLS-R assumes that there is a common structure underlying the blocks of predictor (X) and response (Y) variables, and that this structure can be resumed by a few latent components, explaining the Y, calculated as a linear combination of the X. The components are obtained as to explain the maximum covariance between the Y and X variables. PLS-R is an iterative algorithm, and in any iteration Y-scores, X-weights, X-scores and Y-weights are sequentially calculated one as a function of the previous one. We applied PLS-R to an Italian case-control study of esophageal cancer in order to identify dietary patterns from 33 food groups and 6 nutrients as X and Y variables, respectively. We performed a logistic regression on PLS-R scores and found a strong direct association between a dietary pattern, characterized by a diet poor in fruit and vegetables, and rich in alcohol, red meat, bread, butter, unspecified seed oils, and the risk of esophageal cancer (odds ratio: 5.64 for the highest quintile compared with the lowest, confidence interval: 2.54-12.54).
Settore SECS-S/01 - Statistica
Società Italiana di Statistica (SIS)
Identification of dietary patterns using partial least square regression / V. Rosato, V. Edefonti, A. Decarli, M. Ferraroni. ((Intervento presentato al convegno Statistical Conference on Advances in Latent Variables : Methods, Models and Applications tenutosi a Brescia nel 2013.
Conference Object
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Caricamento pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/253228
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact