Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the 'information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences. This article is protected by copyright. All rights reserved.
A method for partitioning the information contained in a protein sequence between its structure and function / A. Possenti, M. Vendruscolo, C. Camilloni, G. Tiana. - In: PROTEINS. - ISSN 0887-3585. - 86:9(2018 May 23), pp. 956-964.
|Titolo:||A method for partitioning the information contained in a protein sequence between its structure and function|
CAMILLONI, CARLO (Penultimo) (Corresponding)
TIANA, GUIDO (Ultimo) (Corresponding)
|Parole Chiave:||designed proteins; information content; intrinsically disordered proteins; protein folding/function; structure prediction|
|Settore Scientifico Disciplinare:||Settore FIS/03 - Fisica della Materia|
Settore FIS/07 - Fisica Applicata(Beni Culturali, Ambientali, Biol.e Medicin)
|Data di pubblicazione:||23-mag-2018|
|Digital Object Identifier (DOI):||http://dx.doi.org/10.1002/prot.25527|
|Appare nelle tipologie:||01 - Articolo su periodico|