In this work we propose new ensemble methods for the hierarchical classification of gene functions. Our methods exploit the hierarchical relationships between the classes in different ways: each ensemble node is trained “locally”, according to its position in the hierarchy; moreover, in the evaluation phase the set of predicted annotations is built so to minimize a global loss function defined over the hierarchy. We also address the problem of sparsity of annotations by introducing a cost-sensitive parameter that allows to control the precision-recall trade-off. Experiments with the model organism S. cerevisiae, using the FunCat taxonomy and seven biomolecular data sets, reveal a significant advantage of our techniques over “flat” and cost-insensitive hierarchical ensembles.

Hierarchical cost-sensitive algorithms for genome-wide gene function prediction / N. Cesa-Bianchi, G. Valentini. - In: JOURNAL OF MACHINE LEARNING RESEARCH. - ISSN 1532-4435. - 8:(2010), pp. 14-29. ((Intervento presentato al 3. convegno International Workshop on Machine Learning in Systems Biology tenutosi a Ljubljana, Slovenia nel 2009.

Hierarchical cost-sensitive algorithms for genome-wide gene function prediction

N. Cesa-Bianchi
Primo
;
G. Valentini
Ultimo
2010

Abstract

In this work we propose new ensemble methods for the hierarchical classification of gene functions. Our methods exploit the hierarchical relationships between the classes in different ways: each ensemble node is trained “locally”, according to its position in the hierarchy; moreover, in the evaluation phase the set of predicted annotations is built so to minimize a global loss function defined over the hierarchy. We also address the problem of sparsity of annotations by introducing a cost-sensitive parameter that allows to control the precision-recall trade-off. Experiments with the model organism S. cerevisiae, using the FunCat taxonomy and seven biomolecular data sets, reveal a significant advantage of our techniques over “flat” and cost-insensitive hierarchical ensembles.
Hierarchical classification ; Gene function prediction ; Bayesian ensembles ; Cost-sensitive classification ; FunCat taxonomy
Settore INF/01 - Informatica
2010
http://jmlr.csail.mit.edu/proceedings/papers/v8/cesa-bianchi10a/cesa-bianchi10a.pdf
Article (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/143301
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact