Several problems in computational biology and medicine are modeled as learning problems in graphs, where nodes represent the biological entities to be studied, e.g. proteins, and connections different kinds of relationships among them, e.g. protein-protein interactions. In this context, classes are usually characterized by a high imbalance, i.e. positive examples for a class are much less than those negative. Although several works studied this problem, no graph-based software designed to explicitly take into account the label imbalance in biological networks is available. We propose COSNet, an R package to serve this purpose. COSNet deals with the label imbalance problem by implementing a novel parametric model of Hopfield Network (HN), whose output levels and activation thresholds of neurons are parameters to be automatically learnt. Due to the quasi-linear time complexity, COSNet nicely scales when the number of instances is large, and application examples to challenging problems in biomedicine show the efficiency and the accuracy of the proposed library.

COSNet: An R package for label prediction in unbalanced biological networks / M. Frasca, G. Valentini. - In: NEUROCOMPUTING. - ISSN 0925-2312. - 237(2017 May 10), pp. 397-400. [10.1016/j.neucom.2015.11.096]

COSNet: An R package for label prediction in unbalanced biological networks

M. Frasca
;
G. Valentini
Ultimo
2017

Abstract

Several problems in computational biology and medicine are modeled as learning problems in graphs, where nodes represent the biological entities to be studied, e.g. proteins, and connections different kinds of relationships among them, e.g. protein-protein interactions. In this context, classes are usually characterized by a high imbalance, i.e. positive examples for a class are much less than those negative. Although several works studied this problem, no graph-based software designed to explicitly take into account the label imbalance in biological networks is available. We propose COSNet, an R package to serve this purpose. COSNet deals with the label imbalance problem by implementing a novel parametric model of Hopfield Network (HN), whose output levels and activation thresholds of neurons are parameters to be automatically learnt. Due to the quasi-linear time complexity, COSNet nicely scales when the number of instances is large, and application examples to challenging problems in biomedicine show the efficiency and the accuracy of the proposed library.
biological network; label imbalance; node label prediction; protein function prediction; r package; artificial intelligence; computer science applications, computer vision and pattern recognition; cognitive neuroscience
Settore INF/01 - Informatica
10-mag-2017
29-apr-2016
Article (author)
File in questo prodotto:
File Dimensione Formato  
Pubb2.pdf

accesso aperto

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 482.13 kB
Formato Adobe PDF
482.13 kB Adobe PDF Visualizza/Apri
1-s2.0-S0925231216000485-main.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 337.14 kB
Formato Adobe PDF
337.14 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/422136
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 6
social impact