The Human Phenotype Ontology (HPO) provides a standard categorization of the phenotypic abnormalities encountered in human diseases and of the semantic relationship between them. Quite surprisingly the problem of the automated prediction of the association between genes and abnormal human phenotypes has been widely overlooked, even if this issue represents an important step toward the characterization of gene-disease associations, especially when no or very limited knowledge is available about the genetic etiology of the disease under study. We present a novel ensemble method able to capture the hierarchical relationships between HPO terms, and able to improve existing hierarchical ensemble algorithms by explicitly considering the predictions of the descendantterms of the ontology. In this way the algorithm exploits the information embedded in the most specific ontology terms that closely characterize the phenotypic information associated with each human gene. Genome-wide results obtained by integrating multiple sources of information show the effectiveness of the proposed approach.

Ensembling Descendant Term Classifiers to Improve Gene : Abnormal Phenotype Predictions / M. Notaro, M. Schubach, M. Frasca, M. Mesiti, P.N. Robinson, G. Valentini (LECTURE NOTES IN BIOINFORMATICS). - In: Computational Intelligence methods for Bioinformatics and Biostatistics / [a cura di] M. Bartoletti, A. Barla, A. Bracciali, G.W. Klau, L. Peterson, A. Policriti, R. Tagliaferri. - Prima edizione. - Switzerland : Springer Nature, 2019. - ISBN 9783030141592. - pp. 70-80 (( Intervento presentato al 14. convegno Computational Intelligence methods for Bioniformatics and Biostatistics tenutosi a Cagliari nel 2017 [10.1007/978-3-030-14160-8_8].

Ensembling Descendant Term Classifiers to Improve Gene : Abnormal Phenotype Predictions

M. Notaro
Primo
;
M. Frasca;M. Mesiti;G. Valentini
Ultimo
2019

Abstract

The Human Phenotype Ontology (HPO) provides a standard categorization of the phenotypic abnormalities encountered in human diseases and of the semantic relationship between them. Quite surprisingly the problem of the automated prediction of the association between genes and abnormal human phenotypes has been widely overlooked, even if this issue represents an important step toward the characterization of gene-disease associations, especially when no or very limited knowledge is available about the genetic etiology of the disease under study. We present a novel ensemble method able to capture the hierarchical relationships between HPO terms, and able to improve existing hierarchical ensemble algorithms by explicitly considering the predictions of the descendantterms of the ontology. In this way the algorithm exploits the information embedded in the most specific ontology terms that closely characterize the phenotypic information associated with each human gene. Genome-wide results obtained by integrating multiple sources of information show the effectiveness of the proposed approach.
Hierarchical multi-label classification; Hierarchical ensemble methods; Gene-abnormal phenotype prediction
Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
2019
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
DESCENS_LNBI_final.pdf

accesso riservato

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 506.91 kB
Formato Adobe PDF
506.91 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Notaro2019_Chapter_EnsemblingDescendantTermClassi.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 813.63 kB
Formato Adobe PDF
813.63 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/619269
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact