One of the main issues in detecting the genes involved in the etiology of genetic human diseases is the integration of different types of available functional relationships between genes. Numerous approaches exploited the complementary evidence coded in heterogeneous sources of data to prioritize disease-genes, such as functional profiles or expression quantitative trait loci, but none of them to our knowledge posed the scarcity of known disease-genes as a feature of their integration methodology. Nevertheless, in contexts where data are unbalanced, that is, where one class is largely under-represented, imbalance-unaware approaches may suffer a strong decrease in performance. We claim that imbalance-aware integration is a key requirement for boosting performance of gene prioritization (GP) methods. To support our claim, we propose an imbalance-aware integration algorithm for the GP problem, and we compare it on benchmark data with other state-of-the-art integration methodologies.

Disease–Genes Must Guide Data Source Integration in the Gene Prioritization Process / M. Frasca, J.F. Fontaine, G. Valentini, M. Mesiti, M. Notaro, D. Malchiodi, M.A. Andrade-Navarro (LECTURE NOTES IN COMPUTER SCIENCE). - In: Computational Intelligence Methods for Bioinformatics and Biostatistics / [a cura di] M. Bartoletti, A. Barla, A. Bracciali, G. W. Klau, L. Peterson, A. Policriti, R. Tagliaferri. - Prima edizione. - Cham : Springer, 2019. - ISBN 9783030125974. - pp. 60-69 (( Intervento presentato al 14. convegno CIBB tenutosi a Cagliari nel 2017 [10.1007/978-3-030-14160-8_7].

Disease–Genes Must Guide Data Source Integration in the Gene Prioritization Process

M. Frasca
Primo
;
G. Valentini;M. Mesiti;M. Notaro;D. Malchiodi
Penultimo
;
2019

Abstract

One of the main issues in detecting the genes involved in the etiology of genetic human diseases is the integration of different types of available functional relationships between genes. Numerous approaches exploited the complementary evidence coded in heterogeneous sources of data to prioritize disease-genes, such as functional profiles or expression quantitative trait loci, but none of them to our knowledge posed the scarcity of known disease-genes as a feature of their integration methodology. Nevertheless, in contexts where data are unbalanced, that is, where one class is largely under-represented, imbalance-unaware approaches may suffer a strong decrease in performance. We claim that imbalance-aware integration is a key requirement for boosting performance of gene prioritization (GP) methods. To support our claim, we propose an imbalance-aware integration algorithm for the GP problem, and we compare it on benchmark data with other state-of-the-art integration methodologies.
Gene prioritization; Imbalance-aware integration; Medical Subject Headings; Network integration
Settore INF/01 - Informatica
2019
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
LNBI_cibb17_camera_ready.pdf

accesso riservato

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 344.64 kB
Formato Adobe PDF
344.64 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Frasca2019_Chapter_DiseaseGenesMustGuideDataSourc.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 362.03 kB
Formato Adobe PDF
362.03 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/629519
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact