IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

The term ‘combinatorial mixtures’ refers to a flexible class of models for inference on mixture distributions [4] whose components have multidimensional parameters. The idea behind it is to allow each element of component-specific parameter vectors to be shared by a subset of other components. We develop Bayesian inference and computational approaches for this class of distributions. We define a structure for a general prior distribution where a positive probability is put on every possible combination of sharing patterns, whence the name combinatorial mixtures. This partial sharing allows for generality and flexibility in comparison with traditional approaches to mixture modeling, while still allowing to assign significant mass to models that are more parsimonious than the general mixture case in which no sharing takes place. This also unifies the inference on component-specific parameters with that on the number of components. We illustrate our combinatorial mixtures in an application based on the normal model. We introduce normal mixture models for univariate and bivariate data, which are amenable to Markov Chain Monte Carlo computing. In the light of combinatorial mixtures, we assume a decomposition of the variance-covariance matrix proposed by Barnard et al. (2000) [1], which separates out standard deviations and correlations, and thus allows to model those parameters separately. This development was originally motivated by applications in molecular biology, where one deals with continuous measures, such as RNA levels, or protein levels, that vary across unknown biological subtypes. In some cases, subtypes are characterized by an increase in the level of the marker measured, while in others they are characterized by variability in otherwise tightly controlled processes, or by the presence of otherwise weak correlations. Also, several mechanisms can coexist. It may also allow to model an interesting phenomenon observed in microarray analysis when two variables have the same mean and variance but opposite correlations in diseased and normal samples [2]. We use data on molecular classification of lung cancer from the web-based information supporting the published manuscript Garber et al. (2001) [3].

Combinatorial mixtures of multiparameter distributions, with an application to microarray data / V. Edefonti, G. Parmigiani. ((Intervento presentato al 27. convegno International Biometric Conference tenutosi a Firenze nel 2014.

Combinatorial mixtures of multiparameter distributions, with an application to microarray data

V. Edefonti^Primo;G. Parmigiani

2014

Abstract

The term ‘combinatorial mixtures’ refers to a flexible class of models for inference on mixture distributions [4] whose components have multidimensional parameters. The idea behind it is to allow each element of component-specific parameter vectors to be shared by a subset of other components. We develop Bayesian inference and computational approaches for this class of distributions. We define a structure for a general prior distribution where a positive probability is put on every possible combination of sharing patterns, whence the name combinatorial mixtures. This partial sharing allows for generality and flexibility in comparison with traditional approaches to mixture modeling, while still allowing to assign significant mass to models that are more parsimonious than the general mixture case in which no sharing takes place. This also unifies the inference on component-specific parameters with that on the number of components. We illustrate our combinatorial mixtures in an application based on the normal model. We introduce normal mixture models for univariate and bivariate data, which are amenable to Markov Chain Monte Carlo computing. In the light of combinatorial mixtures, we assume a decomposition of the variance-covariance matrix proposed by Barnard et al. (2000) [1], which separates out standard deviations and correlations, and thus allows to model those parameters separately. This development was originally motivated by applications in molecular biology, where one deals with continuous measures, such as RNA levels, or protein levels, that vary across unknown biological subtypes. In some cases, subtypes are characterized by an increase in the level of the marker measured, while in others they are characterized by variability in otherwise tightly controlled processes, or by the presence of otherwise weak correlations. Also, several mechanisms can coexist. It may also allow to model an interesting phenomenon observed in microarray analysis when two variables have the same mean and variance but opposite correlations in diseased and normal samples [2]. We use data on molecular classification of lung cancer from the web-based information supporting the published manuscript Garber et al. (2001) [3].

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di presentazione
	
			lug-2014
		
	Settori scientifico-disciplinari dell'intervento
	
			Settore MED/01 - Statistica Medica
		
	Enti collegati al convegno
	
			International Biometric Society
		
	Citazione
	
			Combinatorial mixtures of multiparameter distributions, with an application to microarray data / V. Edefonti, G. Parmigiani. ((Intervento presentato al 27. convegno International Biometric Conference tenutosi a Firenze nel 2014.
		
	Tipologia
	
			Conference Object
		
	Appare nelle tipologie:
	
			14 - Intervento a convegno non pubblicato

File in questo prodotto:

File	Dimensione	Formato
IBC-2014-Edefonti-Parmigiani.pdf accesso aperto Descrizione: Abstract di poster Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore) Dimensione 64 kB Formato Adobe PDF Visualizza/Apri	64 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/246017

Citazioni

ND

ND

ND

social impact