Membrane transport systems comprise roughly 10% of all proteins in a cell and play a critical role in many biological processes. Improving and expanding their classification is an important goal that can affect studies involving comparative and functional genomics, probing molecular mechanisms of diseases and metabolic processes, and searching new therapeutic targets and pharmacologically relevant transport proteins. In this context, a relevant classification problem is represented by the characterization of transport proteins according to the TC (Transporter Classification) data base (TCDB). In this work we introduce an integrative machine learning based approach that tries to consider all the above issues. To this end we propose a novel structured-output method able to explicitly consider the hierarchical relationships between TCDB categories. The proposed classifier exploits state-of-the-art Multiple Kernel Learning (MKL) strategies to integrate a very large set of features extracted from up-to-date databases and it is conceived to be applied virtually to any organism for the TCDB-wide and proteome-wide prediction of the categories of transporters.
Transport protein classification through structured prediction and multiple lernel learning / H. Su, G. Valentini, S. Szedmak, J. Rousu. ((Intervento presentato al convegno NIPS Workshop on Machine Learning in Computational Biology (MLCB) & Machine Learning in Systems Biology (MLSB) tenutosi a Montreal nel 2015.
Transport protein classification through structured prediction and multiple lernel learning
G. ValentiniSecondo
;
2015
Abstract
Membrane transport systems comprise roughly 10% of all proteins in a cell and play a critical role in many biological processes. Improving and expanding their classification is an important goal that can affect studies involving comparative and functional genomics, probing molecular mechanisms of diseases and metabolic processes, and searching new therapeutic targets and pharmacologically relevant transport proteins. In this context, a relevant classification problem is represented by the characterization of transport proteins according to the TC (Transporter Classification) data base (TCDB). In this work we introduce an integrative machine learning based approach that tries to consider all the above issues. To this end we propose a novel structured-output method able to explicitly consider the hierarchical relationships between TCDB categories. The proposed classifier exploits state-of-the-art Multiple Kernel Learning (MKL) strategies to integrate a very large set of features extracted from up-to-date databases and it is conceived to be applied virtually to any organism for the TCDB-wide and proteome-wide prediction of the categories of transporters.File | Dimensione | Formato | |
---|---|---|---|
mlcb2015.pdf
accesso aperto
Descrizione: Articolo principale
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione
74.34 kB
Formato
Adobe PDF
|
74.34 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.