Metabolism prediction is a crucial step of drug development, as the biotransformations a drug candidate undergoes inside the human body can affect the clinical outcome. Computer-aided drug design has been extensively employed to speed up the process and enhance its efficiency and effectiveness, but among the investigated areas, metabolism has received less attention. This project aimed at leveraging machine learning to analyze large metabolic datasets, make predictions and recognize patterns, in order to fill this knowledge gap and enhance our understanding of metabolism and its impact on drug development. To achieve this goal, we developed a Deep Learning model for metabolism prediction using natural language processing techniques trained on molecular string representations, i.e., Simplified Molecular Input Line Entry Systems (SMILES) strings. To this end, we employ a Molecular Transformer, because of its ability to capture sequential and contextual information within strings (in this case, SMILES) enabling the learning of complex relationships. The transformer was trained using a high quality dataset, MetaQSAR, from which we derived approximately 100 000 instances of metabolic reactions. In this work, we investigate whether the Transformer architecture bears the potential to learn a mapping between the input molecular structures and their corresponding metabolites, in order to expedite drug discovery and improve patient safety.
Predicting Metabolic Reactions with a Molecular Transformer for Drug Design Optimization / S. Multari, R. Özçelik, A. Mazzolari, M.S. Nobile, F. Grisoni - In: CIBCB[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2024 Aug 29. - ISBN 9798350356632. - pp. 1-8 (( Intervento presentato al 21. convegno Conference on Computational Intelligence in Bioinformatics and Computational Biology : 27 through 29 August tenutosi a Natal (Brasil) nel 2024 [10.1109/CIBCB58642.2024.10702115].
Predicting Metabolic Reactions with a Molecular Transformer for Drug Design Optimization
A. Mazzolari;
2024
Abstract
Metabolism prediction is a crucial step of drug development, as the biotransformations a drug candidate undergoes inside the human body can affect the clinical outcome. Computer-aided drug design has been extensively employed to speed up the process and enhance its efficiency and effectiveness, but among the investigated areas, metabolism has received less attention. This project aimed at leveraging machine learning to analyze large metabolic datasets, make predictions and recognize patterns, in order to fill this knowledge gap and enhance our understanding of metabolism and its impact on drug development. To achieve this goal, we developed a Deep Learning model for metabolism prediction using natural language processing techniques trained on molecular string representations, i.e., Simplified Molecular Input Line Entry Systems (SMILES) strings. To this end, we employ a Molecular Transformer, because of its ability to capture sequential and contextual information within strings (in this case, SMILES) enabling the learning of complex relationships. The transformer was trained using a high quality dataset, MetaQSAR, from which we derived approximately 100 000 instances of metabolic reactions. In this work, we investigate whether the Transformer architecture bears the potential to learn a mapping between the input molecular structures and their corresponding metabolites, in order to expedite drug discovery and improve patient safety.| File | Dimensione | Formato | |
|---|---|---|---|
|
secondo conference paper IEEE_CIBCB_2024_paper_40.pdf
accesso aperto
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione
791.26 kB
Formato
Adobe PDF
|
791.26 kB | Adobe PDF | Visualizza/Apri |
|
Predicting_Metabolic_Reactions_with_a_Molecular_Transformer_for_Drug_Design_Optimization.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
815.71 kB
Formato
Adobe PDF
|
815.71 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




