Studying evolutionary correlations in alignments of homologous sequences by means of an inverse Potts model has proven useful to obtain residue-residue contact energies and to predict contacts in proteins. The quality of the results depend much on several choices of the detailed model and on the algorithms used. We built, in a very controlled way, synthetic alignments with statistical properties similar to those of real proteins, and used them to assess the performance of different inversion algorithms and of their variants. Realistic synthetic alignments display typical features of low--temperature phases of disordered systems, a feature that affects the inversion algorithms. We showed that a Boltzmann--learning algorithm is computationally feasible and performs well in predicting the energy of native contacts. However, all algorithms, when applied to alignments of realistic size, suffer of false positives quite equally, making the quality of the prediction of native contacts with the different algorithm much system-dependent. .

Statistical mechanical properties of sequence space determine the efficiency of the various algorithms to predict interaction energies and native contacts from protein coevolution / G. Franco, M. Cagiada, G. Bussi, G. Tiana. - In: PHYSICAL BIOLOGY. - ISSN 1478-3967. - 16:4(2019), pp. 046007.1-046007.14. [10.1088/1478-3975/ab1c15]

Statistical mechanical properties of sequence space determine the efficiency of the various algorithms to predict interaction energies and native contacts from protein coevolution

G. Tiana
2019

Abstract

Studying evolutionary correlations in alignments of homologous sequences by means of an inverse Potts model has proven useful to obtain residue-residue contact energies and to predict contacts in proteins. The quality of the results depend much on several choices of the detailed model and on the algorithms used. We built, in a very controlled way, synthetic alignments with statistical properties similar to those of real proteins, and used them to assess the performance of different inversion algorithms and of their variants. Realistic synthetic alignments display typical features of low--temperature phases of disordered systems, a feature that affects the inversion algorithms. We showed that a Boltzmann--learning algorithm is computationally feasible and performs well in predicting the energy of native contacts. However, all algorithms, when applied to alignments of realistic size, suffer of false positives quite equally, making the quality of the prediction of native contacts with the different algorithm much system-dependent. .
mean field; pseudo likelihood; Boltzmann learning
Settore FIS/03 - Fisica della Materia
2019
Article (author)
File in questo prodotto:
File Dimensione Formato  
manuscript-3.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 1.99 MB
Formato Adobe PDF
1.99 MB Adobe PDF Visualizza/Apri
SuppMat.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 2.61 MB
Formato Adobe PDF
2.61 MB Adobe PDF Visualizza/Apri
Franco_2019_Phys._Biol._16_046007.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 1.57 MB
Formato Adobe PDF
1.57 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/647912
Citazioni
  • ???jsp.display-item.citation.pmc??? 2
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 4
social impact