As the number of sequenced genomes rapidly grows, Automated Prediction of gene Function (AFP) is now a challenging problem. Despite significant progresses in the last several years, the accuracy of gene function prediction still needs to be improved in order to be used effectively in practice. Two of the main issues of AFP problem are the imbalance of gene functional annotations and the 'multifunctional properties' of genes. While the former is a well studied problem in machine learning, the latter has recently emerged in bioinformatics and few studies have been carried out about it. Here we propose a method for AFP which appropriately handles the label imbalance characterizing biological taxonomies, and embeds in the model the property of some genes of being 'multifunctional'. We tested the method in predicting the functions of the Gene Ontology functional hierarchy for genes of yeast and fly model organisms, in a genome-wide approach. The achieved results show that cost-sensitive strategies and 'gene multifunctionality' can be combined to achieve significantly better results than the compared state-of-the-art algorithms for AFP.
|Titolo:||Automated gene function prediction through gene multifunctionality in biological networks|
FRASCA, MARCO (Corresponding)
|Parole Chiave:||Biological networks; Cost-sensitive learning; Gene function prediction; Gene multifunctionality; Gene ranking; Hopfield networks|
|Settore Scientifico Disciplinare:||Settore INF/01 - Informatica|
|Data di pubblicazione:||25-ago-2015|
|Digital Object Identifier (DOI):||10.1016/j.neucom.2015.04.007|
|Appare nelle tipologie:||01 - Articolo su periodico|