Although protein–RNA interactions are crucial for many biological processes, predicting their binding free energies (ΔG)is a challenging task due to limited available experimental data and the complexity of these interactions. To address this is-sue, we developed a machine learning–based model designed to predict energy-based scores for protein–RNA complex-es, called PANTHER Score. By applying a local-to-global approach, we proposed a methodology further subdivided into five steps: (1) We derived 87,117 pairwise local interaction energies from 331,744 MD-derived interactions across 46 curated protein–RNA complexes; (2) we trained ML models on pairwise interaction features to predict local interaction energies without performing MD simulations; (3) we integrated predicted local interaction energies using a local-to-global methodology, to compute model-specific PANTHER Score; (4) we evaluate model-specific PANTHER Score on an indepen-dent test set of seven complexes; and (5) we validated and selected the optimal model using an external stress set of 110 complexes with experimental ΔG values for implementation in the PANTHER Scoring pipeline. Among the regression models developed, Random Forest Regression exhibited the highest predictive performance as a model-specific PANTHER Score, achieveing a Pearson correlation (r) of 0.80 and MAE of 1.79 kcal/mol on the test set. It maintained strong predictive capabilities on the stress set (r = 0.64, MAE = 1.63 kcal/mol). Benchmarking against existing tools on the stress test set, the PANTHER Score demonstrated superior accuracy and reliability. This study highlights the effectiveness of MD and machine learning in addressing data limitations through innovative strategies, positioning the PANTHER Score as a robust tool for predicting protein–RNA binding affinities in biomolecular research, drug discovery and mainly in RNA-therapeutics.
PANTHER Score: Protein-Affinity for Nucleic Target-binding, Hybridization, and Energy Regression / P. Aletayeb, A.D. Biswas, S. Rocca, C. Talarico, G. Vistoli, A. Pedretti. - In: RNA. - ISSN 1355-8382. - 32:2(2026 Jan 16), pp. 131-149. [10.1261/rna.080646.125]
PANTHER Score: Protein-Affinity for Nucleic Target-binding, Hybridization, and Energy Regression
P. Aletayeb
Primo
Investigation
;A.D. BiswasSecondo
Conceptualization
;S. RoccaData Curation
;G. VistoliPenultimo
Writing – Original Draft Preparation
;A. PedrettiUltimo
Funding Acquisition
2026
Abstract
Although protein–RNA interactions are crucial for many biological processes, predicting their binding free energies (ΔG)is a challenging task due to limited available experimental data and the complexity of these interactions. To address this is-sue, we developed a machine learning–based model designed to predict energy-based scores for protein–RNA complex-es, called PANTHER Score. By applying a local-to-global approach, we proposed a methodology further subdivided into five steps: (1) We derived 87,117 pairwise local interaction energies from 331,744 MD-derived interactions across 46 curated protein–RNA complexes; (2) we trained ML models on pairwise interaction features to predict local interaction energies without performing MD simulations; (3) we integrated predicted local interaction energies using a local-to-global methodology, to compute model-specific PANTHER Score; (4) we evaluate model-specific PANTHER Score on an indepen-dent test set of seven complexes; and (5) we validated and selected the optimal model using an external stress set of 110 complexes with experimental ΔG values for implementation in the PANTHER Scoring pipeline. Among the regression models developed, Random Forest Regression exhibited the highest predictive performance as a model-specific PANTHER Score, achieveing a Pearson correlation (r) of 0.80 and MAE of 1.79 kcal/mol on the test set. It maintained strong predictive capabilities on the stress set (r = 0.64, MAE = 1.63 kcal/mol). Benchmarking against existing tools on the stress test set, the PANTHER Score demonstrated superior accuracy and reliability. This study highlights the effectiveness of MD and machine learning in addressing data limitations through innovative strategies, positioning the PANTHER Score as a robust tool for predicting protein–RNA binding affinities in biomolecular research, drug discovery and mainly in RNA-therapeutics.| File | Dimensione | Formato | |
|---|---|---|---|
|
RNA-2025-Aletayeb-rna.080646.125.pdf
accesso aperto
Tipologia:
Pre-print (manoscritto inviato all'editore)
Licenza:
Creative commons
Dimensione
2.83 MB
Formato
Adobe PDF
|
2.83 MB | Adobe PDF | Visualizza/Apri |
|
RNA-2026-Aletayeb-131-49.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Licenza:
Nessuna licenza
Dimensione
9.61 MB
Formato
Adobe PDF
|
9.61 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




