IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

Several solutions have been proposed to exploit the availability of heterogeneous sources of biomolecular data for gene function prediction, but few attention has been dedicated to the evaluation of the potential improvement in functional classification results that could be achieved through data fusion realized by means of ensemble-based techniques. In this contribution we test the performance of several ensembles of Support Vector Machine (SVM) classifiers, in which each component learner has been trained on different types of bio-molecular data, and then combined to obtain a consensus prediction using different aggregation techniques. Experimental results using data obtained with different high-throughput biotechnologies show that simple ensemble methods outperform both learning machines trained on single homogeneous types of bio-molecular data, and vector space integration methods.

Integration of heterogeneous data sources for gene function prediction using Decision Templates and ensembles of learning machines / M. Rè, G. Valentini. - In: NEUROCOMPUTING. - ISSN 0925-2312. - 73:7-9(2010), pp. 1533-1537. [10.1016/j.neucom.2009.12.012]

Integration of heterogeneous data sources for gene function prediction using Decision Templates and ensembles of learning machines

M. Rè^Primo;G. Valentini^Ultimo

2010

Abstract

Several solutions have been proposed to exploit the availability of heterogeneous sources of biomolecular data for gene function prediction, but few attention has been dedicated to the evaluation of the potential improvement in functional classification results that could be achieved through data fusion realized by means of ensemble-based techniques. In this contribution we test the performance of several ensembles of Support Vector Machine (SVM) classifiers, in which each component learner has been trained on different types of bio-molecular data, and then combined to obtain a consensus prediction using different aggregation techniques. Experimental results using data obtained with different high-throughput biotechnologies show that simple ensemble methods outperform both learning machines trained on single homogeneous types of bio-molecular data, and vector space integration methods.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
			Data integration; Decision fusion; Decision templates; Gene function prediction; Majority voting
		
	Settori scientifico-disciplinari dell'articolo
	
			Settore INF/01 - Informatica
		
	Data di pubblicazione
	
			2010
		
	Rivista in ANCE
	
			NEUROCOMPUTING
		
	DOI
	
			https://dx.doi.org/10.1016/j.neucom.2009.12.012
		
	Tipologia
	
			Article (author)
		
	Appare nelle tipologie:
	
			01 - Articolo su periodico

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/143295

Citazioni

ND

17

18

social impact