Computational models predicting the Sites-of-Metabolism (SOMs) of small organic molecules have become invaluable tools for studying and optimizing the metabolic properties of xenobiotics. However, the performance of SOM predictors has shown signs of plateauing in recent years, primarily due to the limited availability of training data. While vast amounts of biotransformation data in the form of substrate-metabolite pairs exist, their potential for SOM prediction remains largely untapped due to the absence of annotations. Annotating SOMs requires expert knowledge and is a highly time-consuming process. To address this challenge, we introduce AutoSOM, the first open-source tool that automatically extracts SOMs by mapping structural differences using transformation rules. AutoSOM is both fast and highly accurate, achieving over 90% labeling accuracy on a diverse validation set of more than 5,000 reactions within minutes. Moreover, its annotation process is fully transparent and interpretable, which we hope will facilitate its adoption in high-stakes downstream applications such as drug discovery campaigns and regulatory assessments. Beyond accelerating annotation, AutoSOM enables standardized and consistent SOM labeling across institutions without requiring direct data sharing.

Automated Annotation of Sites of Metabolism from Biotransformation Data / R.A. Jacob, A. Mazzolari, J. Kirchmair. - In: JOURNAL OF CHEMICAL INFORMATION AND MODELING. - ISSN 1549-9596. - 65:13(2025 Jun 17), pp. 7065-7080. [10.1021/acs.jcim.5c00819]

Automated Annotation of Sites of Metabolism from Biotransformation Data

A. Mazzolari
Penultimo
;
2025

Abstract

Computational models predicting the Sites-of-Metabolism (SOMs) of small organic molecules have become invaluable tools for studying and optimizing the metabolic properties of xenobiotics. However, the performance of SOM predictors has shown signs of plateauing in recent years, primarily due to the limited availability of training data. While vast amounts of biotransformation data in the form of substrate-metabolite pairs exist, their potential for SOM prediction remains largely untapped due to the absence of annotations. Annotating SOMs requires expert knowledge and is a highly time-consuming process. To address this challenge, we introduce AutoSOM, the first open-source tool that automatically extracts SOMs by mapping structural differences using transformation rules. AutoSOM is both fast and highly accurate, achieving over 90% labeling accuracy on a diverse validation set of more than 5,000 reactions within minutes. Moreover, its annotation process is fully transparent and interpretable, which we hope will facilitate its adoption in high-stakes downstream applications such as drug discovery campaigns and regulatory assessments. Beyond accelerating annotation, AutoSOM enables standardized and consistent SOM labeling across institutions without requiring direct data sharing.
Settore CHEM-07/A - Chimica farmaceutica
17-giu-2025
Article (author)
File in questo prodotto:
File Dimensione Formato  
automated-annotation-of-sites-of-metabolism-from-biotransformation-data.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Licenza: Creative commons
Dimensione 3.11 MB
Formato Adobe PDF
3.11 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1242964
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
  • OpenAlex ND
social impact