One of the main issues in detecting the genes involved in the etiology of genetic human diseases is the integration of different types of available functional relationships between genes. Numerous approaches exploited the complementary evidence coded in heterogeneous sources of data to prioritize disease-genes, such as functional profiles or expression quantitative trait loci, but none of them to our knowledge posed the scarcity of known disease-genes as a feature of their integration methodology. Nevertheless, in contexts where data are unbalanced, that is, where one class is largely under-represented, imbalance-unaware approaches may suffer a strong decrease in performance. We claim that imbalance-aware integration is a key requirement for boosting performance of gene prioritization (GP) methods. To support our claim, we propose an imbalance-aware integration algorithm for the GP problem, and we compare it on benchmark data with other state-of-the-art integration methodologies.
Disease–Genes Must Guide Data Source Integration in the Gene Prioritization Process / M. Frasca, J.F. Fontaine, G. Valentini, M. Mesiti, M. Notaro, D. Malchiodi, M.A. Andrade-Navarro (LECTURE NOTES IN COMPUTER SCIENCE). - In: Computational Intelligence Methods for Bioinformatics and Biostatistics / [a cura di] M. Bartoletti, A. Barla, A. Bracciali, G. W. Klau, L. Peterson, A. Policriti, R. Tagliaferri. - Prima edizione. - Cham : Springer, 2019. - ISBN 9783030125974. - pp. 60-69 (( Intervento presentato al 14. convegno CIBB tenutosi a Cagliari nel 2017 [10.1007/978-3-030-14160-8_7].
Disease–Genes Must Guide Data Source Integration in the Gene Prioritization Process
M. Frasca
Primo
;G. Valentini;M. Mesiti;M. Notaro;D. MalchiodiPenultimo
;
2019
Abstract
One of the main issues in detecting the genes involved in the etiology of genetic human diseases is the integration of different types of available functional relationships between genes. Numerous approaches exploited the complementary evidence coded in heterogeneous sources of data to prioritize disease-genes, such as functional profiles or expression quantitative trait loci, but none of them to our knowledge posed the scarcity of known disease-genes as a feature of their integration methodology. Nevertheless, in contexts where data are unbalanced, that is, where one class is largely under-represented, imbalance-unaware approaches may suffer a strong decrease in performance. We claim that imbalance-aware integration is a key requirement for boosting performance of gene prioritization (GP) methods. To support our claim, we propose an imbalance-aware integration algorithm for the GP problem, and we compare it on benchmark data with other state-of-the-art integration methodologies.File | Dimensione | Formato | |
---|---|---|---|
LNBI_cibb17_camera_ready.pdf
accesso riservato
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
344.64 kB
Formato
Adobe PDF
|
344.64 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Frasca2019_Chapter_DiseaseGenesMustGuideDataSourc.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
362.03 kB
Formato
Adobe PDF
|
362.03 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.