Relevant problems in the context of molecular biology and medicine can be modeled through graphs where nodes represent bio-molecular or chemical entities (e.g. genes or drugs) and edges some notion of similarity between them. In this context, semi-supervised learning methods able to exploit both the local (e.g. the neighborhood of a node) and the global characteristics of the network (e.g. its overall topology) have been applied to extract meaningful biological and medical knowledge from a biological system. In this work we summarize the main characteristics of RANKS (RAnking Nodes through Kernelized Score functions), a recently proposed semi-supervised algorithmic scheme based on local score functions embedding well-designed graph kernels, able to deal with both the local and the global features of the analyzed network. We show some successful applications of RANKS in the context of protein function prediction, gene disease association and drug repositioning problems. Moreover we present a novel secondary memory-based and "vertex-centric" version of the algorithm able to nicely scale on graphs with hundreds of thousands of nodes and tens of millions of edges, using off-the-shelf desktop computers, and we show an application to a complex multi-species protein function prediction problem.
Analysis of bio-molecular networks through semi-supervised graph-based learning methods / M. Re, M. Mesiti, M. Frasca, J. Lin, G. Valentini. ((Intervento presentato al 13. convegno Italian Workshop on Machine Learning and Data Mining - AI*IA Symposium on Artificial Intelligence tenutosi a Pisa nel 2014.
|Titolo:||Analysis of bio-molecular networks through semi-supervised graph-based learning methods|
RE', MATTEO (Primo)
MESITI, MARCO (Secondo)
LIN, JIANYI (Penultimo)
VALENTINI, GIORGIO (Ultimo)
|Data di pubblicazione:||dic-2014|
|Parole Chiave:||graph based learning; biomolecular network analysis; bioinformatics; big data analysis|
|Settore Scientifico Disciplinare:||Settore INF/01 - Informatica|
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
|Citazione:||Analysis of bio-molecular networks through semi-supervised graph-based learning methods / M. Re, M. Mesiti, M. Frasca, J. Lin, G. Valentini. ((Intervento presentato al 13. convegno Italian Workshop on Machine Learning and Data Mining - AI*IA Symposium on Artificial Intelligence tenutosi a Pisa nel 2014.|
|Appare nelle tipologie:||14 - Intervento a convegno non pubblicato|