The genome-wide hierarchical classification of gene functions, using biomolecular data from high-throughput biotechnologies, is one of the central topics in bioinformatics and functional genomics. In this paper we present a multilabel hierarchical algorithm inspired by the “true path rule” that governs both the Gene Ontology and the Functional Catalogue (FunCat). In particular we propose an enhanced version of the True Path Rule (TPR) algorithm, by which we can control the flow of information between the classifiers of the hierarchical ensemble, thus allowing to tune the precision/recall characteristics of the overall hierarchical classification system. Results with the model organism S. cerevisiae show that the proposed method significantly improves on the basic version of the TPR algorithm, as well as on the Hierarchical Top-down and Flat ensembles.
Weighted True Path Rule: a multilabel hierarchical algorithm for gene function prediction / G. Valentini, M. Re. ((Intervento presentato al 1. convegno European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases : 1st International Workshop on learning from Multi-Label Data tenutosi a Bled, Slovenia nel 2009.
Weighted True Path Rule: a multilabel hierarchical algorithm for gene function prediction
G. ValentiniPrimo
;M. ReUltimo
2009
Abstract
The genome-wide hierarchical classification of gene functions, using biomolecular data from high-throughput biotechnologies, is one of the central topics in bioinformatics and functional genomics. In this paper we present a multilabel hierarchical algorithm inspired by the “true path rule” that governs both the Gene Ontology and the Functional Catalogue (FunCat). In particular we propose an enhanced version of the True Path Rule (TPR) algorithm, by which we can control the flow of information between the classifiers of the hierarchical ensemble, thus allowing to tune the precision/recall characteristics of the overall hierarchical classification system. Results with the model organism S. cerevisiae show that the proposed method significantly improves on the basic version of the TPR algorithm, as well as on the Hierarchical Top-down and Flat ensembles.| File | Dimensione | Formato | |
|---|---|---|---|
|
vale-re-ECML09.revised.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
258.44 kB
Formato
Adobe PDF
|
258.44 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




