The semi-supervised problem of learning node labels in graphs consists, given a partial graph labeling, in inferring the unknown labels of the unlabeled vertices. Several machine learning algorithms have been proposed for solving this problem, including Hopfield networks and label propagation methods; however, some issues have been only partially considered, e.g. the preservation of the prior knowledge and the unbalance between positive and negative labels. To address these items, we propose a Hopfield-based cost sensitive neural network algorithm (COSNet). The method factorizes the solution of the problem in two parts: 1) the sub- network composed by the labelled vertices is considered, and the net- work parameters are estimated through a supervised algorithm; 2) the estimated parameters are extended to the subnetwork composed of the unlabeled vertices, and the attractor reached by the dynamics of this subnetwork allows to predict the labeling of the unlabeled vertices. The proposed method embeds in the neural algorithm the ”a priori” knowl- edge coded in the labelled part of the graph, and separates node labels and neuron states, allowing to differentially weight positive and nega- tive node labels. Moreover, COSNet introduces an efficient cost-sensitive strategy which allows to learn the near-optimal parameters of the net- work in order to take into account the unbalance between positive and negative node labels. Finally, the dynamics of the network is restricted to its unlabeled part, preserving the minimization of the overall objective function and significantly reducing the time complexity of the learning algorithm. COSNet has been applied to the genome-wide prediction of gene function in a model organism. The results, compared with those ob- tained by other semi-supervised label propagation algorithms and super- vised machine learning methods, show the effectiveness of the proposed approach.

COSNet : a cost sensitive neural network for semi-supervised learning in graphs / A. Bertoni, M. Frasca, G. Valentini - In: Machine learning and knowledge discovery in databases : European conference, ECML PKDD 2010 : Athens, Greece, september 5-9, 2011 : proceedings. Part 1 / [a cura di] D. Gunopulos, T. Hofmann, D. Malerba, M. Vazirgiannis. - Berlin : Springer, 2011. - ISBN 9783642237799. - pp. 219-234 (( Intervento presentato al 21. convegno European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) tenutosi a Athens nel 2011 [10.1007/978-3-642-23780-5_24].

COSNet : a cost sensitive neural network for semi-supervised learning in graphs

A. Bertoni
Primo
;
M. Frasca
Secondo
;
G. Valentini
Ultimo
2011

Abstract

The semi-supervised problem of learning node labels in graphs consists, given a partial graph labeling, in inferring the unknown labels of the unlabeled vertices. Several machine learning algorithms have been proposed for solving this problem, including Hopfield networks and label propagation methods; however, some issues have been only partially considered, e.g. the preservation of the prior knowledge and the unbalance between positive and negative labels. To address these items, we propose a Hopfield-based cost sensitive neural network algorithm (COSNet). The method factorizes the solution of the problem in two parts: 1) the sub- network composed by the labelled vertices is considered, and the net- work parameters are estimated through a supervised algorithm; 2) the estimated parameters are extended to the subnetwork composed of the unlabeled vertices, and the attractor reached by the dynamics of this subnetwork allows to predict the labeling of the unlabeled vertices. The proposed method embeds in the neural algorithm the ”a priori” knowl- edge coded in the labelled part of the graph, and separates node labels and neuron states, allowing to differentially weight positive and nega- tive node labels. Moreover, COSNet introduces an efficient cost-sensitive strategy which allows to learn the near-optimal parameters of the net- work in order to take into account the unbalance between positive and negative node labels. Finally, the dynamics of the network is restricted to its unlabeled part, preserving the minimization of the overall objective function and significantly reducing the time complexity of the learning algorithm. COSNet has been applied to the genome-wide prediction of gene function in a model organism. The results, compared with those ob- tained by other semi-supervised label propagation algorithms and super- vised machine learning methods, show the effectiveness of the proposed approach.
Settore INF/01 - Informatica
   Pattern Analysis, Statistical Modelling and Computational Learning 2
   PASCAL2
   EUROPEAN COMMISSION
   FP7
   216886
2011
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
Cosnet.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 271.43 kB
Formato Adobe PDF
271.43 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/177874
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 27
  • ???jsp.display-item.citation.isi??? 20
social impact