In pooled analyses, where covariates may not be uniformly defined and coded across studies, and occasionally not measured in all of them, a joint model is often not feasible, and a two-stage method (see [1] for instance) is a simple, valid and practical method for the analysis, lending itself to flexibility with respect to differences in design, confounders and data collection across studies and to a better control for confounding. Simulations indicate that when the individual studies are large, two-stage methods produce nearly unbiased exposure estimates and standard errors of the exposure estimates from a generalized linear mixed model (GLMM). Based on these considerations, several existing cancer epidemiology consortia suggested in their protocols to apply two-stage methods instead of fitting GLMMs directly on the overall sample (see [2] for instance), although they still struggle to harmonize exposure variables of interest and potential confounders. However, it is unclear how well the two-stage method would perform if individual studies were smaller, especially when there are a few of them [3]. This may be a critical issue, especially as far as evidence has been accumulating on the major research questions a consortium was born for, and time is mature for more specific analyses on subgroups of studies. The International Head and Neck Cancer Epidemiology (INHANCE) consortium was established in 2004 to contribute elucidating the aetiology of head and neck cancer by providing opportunities for pooled analyses of individual-level data on a large scale. In the current version, the consortium included 35 case-control studies, with questionnaire data on over 26,000 cases and 34,000 controls [4]. Only 10 studies in the consortium provided detailed information on nutrient intakes. The available studies differ in terms of number of subjects included, geographical region, assessment of dietary information and food composition tables. This naturally provides a real-life situation of interest to understand how big is the difference, if any, in the effect estimates derived from the two-stage methods and the GLMM. In detail, we discuss this point in an application on vitamin C intake and head and neck cancer, after a preprocessing step improving comparability across studies. This research question may have important fallouts in the refinement of the statistical methods supporting the recent global effort towards pooling individual-level data in consortia.

Use of the two-stage methods in individual-level pooled data analyses : are they always the right solution? / V. Edefonti, M. Ferraroni. ((Intervento presentato al 27. convegno International Biometric Conference tenutosi a Firenze nel 2014.

Use of the two-stage methods in individual-level pooled data analyses : are they always the right solution?

V. Edefonti
Primo
;
M. Ferraroni
Ultimo
2014

Abstract

In pooled analyses, where covariates may not be uniformly defined and coded across studies, and occasionally not measured in all of them, a joint model is often not feasible, and a two-stage method (see [1] for instance) is a simple, valid and practical method for the analysis, lending itself to flexibility with respect to differences in design, confounders and data collection across studies and to a better control for confounding. Simulations indicate that when the individual studies are large, two-stage methods produce nearly unbiased exposure estimates and standard errors of the exposure estimates from a generalized linear mixed model (GLMM). Based on these considerations, several existing cancer epidemiology consortia suggested in their protocols to apply two-stage methods instead of fitting GLMMs directly on the overall sample (see [2] for instance), although they still struggle to harmonize exposure variables of interest and potential confounders. However, it is unclear how well the two-stage method would perform if individual studies were smaller, especially when there are a few of them [3]. This may be a critical issue, especially as far as evidence has been accumulating on the major research questions a consortium was born for, and time is mature for more specific analyses on subgroups of studies. The International Head and Neck Cancer Epidemiology (INHANCE) consortium was established in 2004 to contribute elucidating the aetiology of head and neck cancer by providing opportunities for pooled analyses of individual-level data on a large scale. In the current version, the consortium included 35 case-control studies, with questionnaire data on over 26,000 cases and 34,000 controls [4]. Only 10 studies in the consortium provided detailed information on nutrient intakes. The available studies differ in terms of number of subjects included, geographical region, assessment of dietary information and food composition tables. This naturally provides a real-life situation of interest to understand how big is the difference, if any, in the effect estimates derived from the two-stage methods and the GLMM. In detail, we discuss this point in an application on vitamin C intake and head and neck cancer, after a preprocessing step improving comparability across studies. This research question may have important fallouts in the refinement of the statistical methods supporting the recent global effort towards pooling individual-level data in consortia.
lug-2014
Settore MED/01 - Statistica Medica
International Biometric Society
Use of the two-stage methods in individual-level pooled data analyses : are they always the right solution? / V. Edefonti, M. Ferraroni. ((Intervento presentato al 27. convegno International Biometric Conference tenutosi a Firenze nel 2014.
Conference Object
File in questo prodotto:
File Dimensione Formato  
IBC-2014-Edefonti_inhance-2.pdf

accesso aperto

Descrizione: Abstract comunicazione orale
Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 60.32 kB
Formato Adobe PDF
60.32 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/246012
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact