Finding genes associated with human genetic disorders is one of the most challenging problems in bio-medicine. In this context, to guide researchers in detecting the most reliable candidate causative-genes for the disease of interest, gene prioritization methods represent a necessary support to automatically rank genes according to their involvement in the disease under study. This problem is characterized by highly unbalanced classes (few causative and much more non-causative genes) and requires the adoption of cost-sensitive techniques to achieve reliable solutions. In this work we propose a network-based methodology for disease-gene prioritization designed to expressly cope with the data imbalance. Its validation over a benchmark composed of 708 selected medical subject headings (MeSH) diseases, shows that our approach is competitive with state-of-art methodologies, and its reduced time complexity makes its application feasible on large-size datasets.
Gene-disease prioritization through cost-sensitive graph-based methodologies / M. Frasca, S. Bassis (LECTURE NOTES IN COMPUTER SCIENCE). - In: Bioinformatics and biomedical engineering / [a cura di] F. Ortuño, I. Rojas. - Prima edizione. - [s.l] : Springer International, 2016. - ISBN 9783319317434. - pp. 739-751 (( Intervento presentato al 4. convegno International Work-Conference on Bioinformatics and Biomedical Engineering tenutosi a Granada nel 2016 [10.1007/978-3-319-31744-1_64].
Gene-disease prioritization through cost-sensitive graph-based methodologies
M. FrascaPrimo
;S. BassisUltimo
2016
Abstract
Finding genes associated with human genetic disorders is one of the most challenging problems in bio-medicine. In this context, to guide researchers in detecting the most reliable candidate causative-genes for the disease of interest, gene prioritization methods represent a necessary support to automatically rank genes according to their involvement in the disease under study. This problem is characterized by highly unbalanced classes (few causative and much more non-causative genes) and requires the adoption of cost-sensitive techniques to achieve reliable solutions. In this work we propose a network-based methodology for disease-gene prioritization designed to expressly cope with the data imbalance. Its validation over a benchmark composed of 708 selected medical subject headings (MeSH) diseases, shows that our approach is competitive with state-of-art methodologies, and its reduced time complexity makes its application feasible on large-size datasets.File | Dimensione | Formato | |
---|---|---|---|
CS_gp_RED_v2.pdf
accesso riservato
Descrizione: Articolo principale
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
408.51 kB
Formato
Adobe PDF
|
408.51 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
chp%3A10.1007%2F978-3-319-31744-1_64.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
893.99 kB
Formato
Adobe PDF
|
893.99 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.