Given a weighted graph and a partial node labeling, the graph classification problem consists in predicting the labels of all the nodes. In several application domains, from gene to social network analysis, the labeling is unbalanced: for instance positive labels may be much less than negatives. In this paper we present COSNet (COst Sensitive neural Network), a neural algorithm for predicting node labels in graphs with unbalanced labels. COSNet is based on a 2-parameter family of Hopfield networks, and consists of two main steps: (1) the network parameters are learned through a cost-sensitive optimization procedure; (2) a suitable Hopfield network restricted to the unlabeled nodes is considered and simulated. The reached equilibrium point induces the classification of the unlabeled nodes. The restriction of the dynamics leads to a significant reduction in time complexity and allows the algorithm to nicely scale with large networks. An experimental analysis on real-world unbalanced data, in the context of the genome-wide prediction of gene functions, shows the effectiveness of the proposed approach.

A neural network algorithm for semi-supervised node label learning from unbalanced data / M.Frasca, A. Bertoni, M. Re, G. Valentini. - In: NEURAL NETWORKS. - ISSN 0893-6080. - 43(2013), pp. 84-98. [10.1016/j.neunet.2013.01.021]

A neural network algorithm for semi-supervised node label learning from unbalanced data

M. Frasca
Primo
;
A. Bertoni
Secondo
;
M. Re
Penultimo
;
G. Valentini
Ultimo
2013

Abstract

Given a weighted graph and a partial node labeling, the graph classification problem consists in predicting the labels of all the nodes. In several application domains, from gene to social network analysis, the labeling is unbalanced: for instance positive labels may be much less than negatives. In this paper we present COSNet (COst Sensitive neural Network), a neural algorithm for predicting node labels in graphs with unbalanced labels. COSNet is based on a 2-parameter family of Hopfield networks, and consists of two main steps: (1) the network parameters are learned through a cost-sensitive optimization procedure; (2) a suitable Hopfield network restricted to the unlabeled nodes is considered and simulated. The reached equilibrium point induces the classification of the unlabeled nodes. The restriction of the dynamics leads to a significant reduction in time complexity and allows the algorithm to nicely scale with large networks. An experimental analysis on real-world unbalanced data, in the context of the genome-wide prediction of gene functions, shows the effectiveness of the proposed approach.
Hopfield neural networks; Learning from unbalanced data; Node label prediction; Semi-supervised learning in graphs
Settore INF/01 - Informatica
   Pattern Analysis, Statistical Modelling and Computational Learning 2
   PASCAL2
   EUROPEAN COMMISSION
   FP7
   216886
2013
Article (author)
File in questo prodotto:
File Dimensione Formato  
COSNet.2nd_rev.pdf

accesso aperto

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 356.68 kB
Formato Adobe PDF
356.68 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/219468
Citazioni
  • ???jsp.display-item.citation.pmc??? 10
  • Scopus 44
  • ???jsp.display-item.citation.isi??? 38
social impact