Although protein–RNA interactions are crucial for many biological processes, predicting their binding free energies (ΔG)is a challenging task due to limited available experimental data and the complexity of these interactions. To address this is-sue, we developed a machine learning–based model designed to predict energy-based scores for protein–RNA complex-es, called PANTHER Score. By applying a local-to-global approach, we proposed a methodology further subdivided into five steps: (1) We derived 87,117 pairwise local interaction energies from 331,744 MD-derived interactions across 46 curated protein–RNA complexes; (2) we trained ML models on pairwise interaction features to predict local interaction energies without performing MD simulations; (3) we integrated predicted local interaction energies using a local-to-global methodology, to compute model-specific PANTHER Score; (4) we evaluate model-specific PANTHER Score on an indepen-dent test set of seven complexes; and (5) we validated and selected the optimal model using an external stress set of 110 complexes with experimental ΔG values for implementation in the PANTHER Scoring pipeline. Among the regression models developed, Random Forest Regression exhibited the highest predictive performance as a model-specific PANTHER Score, achieveing a Pearson correlation (r) of 0.80 and MAE of 1.79 kcal/mol on the test set. It maintained strong predictive capabilities on the stress set (r = 0.64, MAE = 1.63 kcal/mol). Benchmarking against existing tools on the stress test set, the PANTHER Score demonstrated superior accuracy and reliability. This study highlights the effectiveness of MD and machine learning in addressing data limitations through innovative strategies, positioning the PANTHER Score as a robust tool for predicting protein–RNA binding affinities in biomolecular research, drug discovery and mainly in RNA-therapeutics.

PANTHER Score: Protein-Affinity for Nucleic Target-binding, Hybridization, and Energy Regression / P. Aletayeb, A.D. Biswas, S. Rocca, C. Talarico, G. Vistoli, A. Pedretti. - In: RNA. - ISSN 1355-8382. - 32:2(2026 Jan 16), pp. 131-149. [10.1261/rna.080646.125]

PANTHER Score: Protein-Affinity for Nucleic Target-binding, Hybridization, and Energy Regression

P. Aletayeb
Primo
Investigation
;
A.D. Biswas
Secondo
Conceptualization
;
S. Rocca
Data Curation
;
G. Vistoli
Penultimo
Writing – Original Draft Preparation
;
A. Pedretti
Ultimo
Funding Acquisition
2026

Abstract

Although protein–RNA interactions are crucial for many biological processes, predicting their binding free energies (ΔG)is a challenging task due to limited available experimental data and the complexity of these interactions. To address this is-sue, we developed a machine learning–based model designed to predict energy-based scores for protein–RNA complex-es, called PANTHER Score. By applying a local-to-global approach, we proposed a methodology further subdivided into five steps: (1) We derived 87,117 pairwise local interaction energies from 331,744 MD-derived interactions across 46 curated protein–RNA complexes; (2) we trained ML models on pairwise interaction features to predict local interaction energies without performing MD simulations; (3) we integrated predicted local interaction energies using a local-to-global methodology, to compute model-specific PANTHER Score; (4) we evaluate model-specific PANTHER Score on an indepen-dent test set of seven complexes; and (5) we validated and selected the optimal model using an external stress set of 110 complexes with experimental ΔG values for implementation in the PANTHER Scoring pipeline. Among the regression models developed, Random Forest Regression exhibited the highest predictive performance as a model-specific PANTHER Score, achieveing a Pearson correlation (r) of 0.80 and MAE of 1.79 kcal/mol on the test set. It maintained strong predictive capabilities on the stress set (r = 0.64, MAE = 1.63 kcal/mol). Benchmarking against existing tools on the stress test set, the PANTHER Score demonstrated superior accuracy and reliability. This study highlights the effectiveness of MD and machine learning in addressing data limitations through innovative strategies, positioning the PANTHER Score as a robust tool for predicting protein–RNA binding affinities in biomolecular research, drug discovery and mainly in RNA-therapeutics.
RNA-therapeutics; binding free energy (ΔG); machine learning models; pairwise interaction energies; predictive modeling; protein-RNA interactions;
Settore CHEM-07/A - Chimica farmaceutica
   National Center for Gene Therapy and Drugs based on RNA Technology (CN3 RNA)
   CN3 RNA
   MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
   CN00000041
16-gen-2026
RNA
Article (author)
File in questo prodotto:
File Dimensione Formato  
RNA-2025-Aletayeb-rna.080646.125.pdf

accesso aperto

Tipologia: Pre-print (manoscritto inviato all'editore)
Licenza: Creative commons
Dimensione 2.83 MB
Formato Adobe PDF
2.83 MB Adobe PDF Visualizza/Apri
RNA-2026-Aletayeb-131-49.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Licenza: Nessuna licenza
Dimensione 9.61 MB
Formato Adobe PDF
9.61 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1242276
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact