The multi-label hierarchical prediction of gene functions at genome and ontology-wide level is a central problem in bioinformatics, and raises challenging questions from a machine learning standpoint. In this context, multi-label hierarchical ensemble methods that take into account the hierarchical relationships between functional classes have been recently proposed. Various studies also showed that the integration of multiple sources of data is one of the key issues to significantly improve gene function prediction. We propose an integrated approach that combines local data fusion strategies with global hierarchical multi-label methods. The label unbalance typically occurring in gene functional classes is taken into account through the use of cost-sensitive techniques. Ontology-wide results with the yeast model organism, using the FunCat taxonomy, show the effectiveness of the proposed methodological approach.
Functional inference in FunCat through the combination of hierarchical ensembles with data fusion methods / N. Cesa-Bianchi, M. Re, G. Valentini. ((Intervento presentato al 2. convegno ICML Workshop on learning from Multi-Label Data MLD'10 tenutosi a Haifa, Israel nel 2010.
Functional inference in FunCat through the combination of hierarchical ensembles with data fusion methods
N. Cesa-BianchiPrimo
;M. ReSecondo
;G. ValentiniUltimo
2010
Abstract
The multi-label hierarchical prediction of gene functions at genome and ontology-wide level is a central problem in bioinformatics, and raises challenging questions from a machine learning standpoint. In this context, multi-label hierarchical ensemble methods that take into account the hierarchical relationships between functional classes have been recently proposed. Various studies also showed that the integration of multiple sources of data is one of the key issues to significantly improve gene function prediction. We propose an integrated approach that combines local data fusion strategies with global hierarchical multi-label methods. The label unbalance typically occurring in gene functional classes is taken into account through the use of cost-sensitive techniques. Ontology-wide results with the yeast model organism, using the FunCat taxonomy, show the effectiveness of the proposed methodological approach.File | Dimensione | Formato | |
---|---|---|---|
cesa-re-vale.ICML.MLD10.rev.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
202 kB
Formato
Adobe PDF
|
202 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.