Machine learning potentials (MLPs) are widely applied as an efficient alternative way to represent potential energy surfaces (PESs) in many chemical simulations. The MLPs are often evaluated with the root-mean-square errors on the test set drawn from the same distribution as the training data. Here, we systematically investigate the relationship between such test errors and the simulation accuracy with MLPs on an example of a full-dimensional, global PES for the glycine amino acid. Our results show that the errors in the test set do not unambiguously reflect the MLP performance in different simulation tasks, such as relative conformer energies, barriers, vibrational levels, and zero-point vibrational energies. We also offer an easily accessible solution for improving the MLP quality in a simulation-oriented manner, yielding the most precise relative conformer energies and barriers. This solution also passed the stringent test by diffusion Monte Carlo simulations.
Tell Machine Learning Potentials What They Are Needed For: Simulation-Oriented Training Exemplified for Glycine / F. Ge, R. Wang, C. Qu, P. Zheng, A. Nandi, R. Conte, P.L. Houston, J.M. Bowman, P.O. Dral. - In: THE JOURNAL OF PHYSICAL CHEMISTRY LETTERS. - ISSN 1948-7185. - 15:16(2024 Apr 25), pp. 4451-4460. [10.1021/acs.jpclett.4c00746]
Tell Machine Learning Potentials What They Are Needed For: Simulation-Oriented Training Exemplified for Glycine
R. Conte;
2024
Abstract
Machine learning potentials (MLPs) are widely applied as an efficient alternative way to represent potential energy surfaces (PESs) in many chemical simulations. The MLPs are often evaluated with the root-mean-square errors on the test set drawn from the same distribution as the training data. Here, we systematically investigate the relationship between such test errors and the simulation accuracy with MLPs on an example of a full-dimensional, global PES for the glycine amino acid. Our results show that the errors in the test set do not unambiguously reflect the MLP performance in different simulation tasks, such as relative conformer energies, barriers, vibrational levels, and zero-point vibrational energies. We also offer an easily accessible solution for improving the MLP quality in a simulation-oriented manner, yielding the most precise relative conformer energies and barriers. This solution also passed the stringent test by diffusion Monte Carlo simulations.File | Dimensione | Formato | |
---|---|---|---|
tell-machine-learning.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
2.52 MB
Formato
Adobe PDF
|
2.52 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
DMC_revised.pdf
accesso aperto
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
909 kB
Formato
Adobe PDF
|
909 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.