Finding genes associated with human genetic disorders is one of the most challenging problems in bio-medicine. In this context, to guide researchers in detecting the most reliable candidate causative-genes for the disease of interest, gene prioritization methods represent a necessary support to automatically rank genes according to their involvement in the disease under study. This problem is characterized by highly unbalanced classes (few causative and much more non-causative genes) and requires the adoption of cost-sensitive techniques to achieve reliable solutions. In this work we propose a network-based methodology for disease-gene prioritization designed to expressly cope with the data imbalance. Its validation over a benchmark composed of 708 selected medical subject headings (MeSH) diseases, shows that our approach is competitive with state-of-art methodologies, and its reduced time complexity makes its application feasible on large-size datasets.

Gene-disease prioritization through cost-sensitive graph-based methodologies / M. Frasca, S. Bassis (LECTURE NOTES IN COMPUTER SCIENCE). - In: Bioinformatics and biomedical engineering / [a cura di] F. Ortuño, I. Rojas. - Prima edizione. - [s.l] : Springer International, 2016. - ISBN 9783319317434. - pp. 739-751 (( Intervento presentato al 4. convegno International Work-Conference on Bioinformatics and Biomedical Engineering tenutosi a Granada nel 2016 [10.1007/978-3-319-31744-1_64].

Gene-disease prioritization through cost-sensitive graph-based methodologies

M. Frasca
Primo
;
S. Bassis
Ultimo
2016

Abstract

Finding genes associated with human genetic disorders is one of the most challenging problems in bio-medicine. In this context, to guide researchers in detecting the most reliable candidate causative-genes for the disease of interest, gene prioritization methods represent a necessary support to automatically rank genes according to their involvement in the disease under study. This problem is characterized by highly unbalanced classes (few causative and much more non-causative genes) and requires the adoption of cost-sensitive techniques to achieve reliable solutions. In this work we propose a network-based methodology for disease-gene prioritization designed to expressly cope with the data imbalance. Its validation over a benchmark composed of 708 selected medical subject headings (MeSH) diseases, shows that our approach is competitive with state-of-art methodologies, and its reduced time complexity makes its application feasible on large-size datasets.
Gene-disease prioritization; graph-based node ranking; cost-sensitive learning
Settore INF/01 - Informatica
2016
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
CS_gp_RED_v2.pdf

accesso riservato

Descrizione: Articolo principale
Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 408.51 kB
Formato Adobe PDF
408.51 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
chp%3A10.1007%2F978-3-319-31744-1_64.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 893.99 kB
Formato Adobe PDF
893.99 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/388362
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 2
social impact