Even though glucuronidations are the most frequent metabolic reactions of conjugation, both in quantitative and qualitative terms, they have rather seldom been investigated using computational approaches. To fill this gap, we have used the manually collected MetaQSAR metabolic reaction database to generate two models for the prediction of UGT-mediated metabolism, both based on molecular descriptors and implementing the Random Forest algorithm. The first model predicts the occurrence of the reaction and was internally validated with a Matthew correlation coefficient (MCC) of 0.76 and an area under the ROC curve (AUC) of 0.94, and further externally validated using a test set composed of 120 additional xenobiotics (MCC of 0.70 and AUC of 0.90). The second model distinguishes between O- and N-glucuronidations and was optimized by the random undersampling procedure to improve the predictive accuracy during the internal validation, with the recall measure of the minority class increasing from 0.55 to 0.78.

Prediction of UGT-mediated Metabolism Using the Manually Curated MetaQSAR Database / A. Mazzolari, A.M. Afzal, A. Pedretti, B. Testa, G. Vistoli, A. Bender. - In: ACS MEDICINAL CHEMISTRY LETTERS. - ISSN 1948-5875. - 10:4(2019 Apr 11), pp. 633-638.

Prediction of UGT-mediated Metabolism Using the Manually Curated MetaQSAR Database

A. Mazzolari
Primo
;
A. Pedretti;G. Vistoli
Penultimo
;
2019

Abstract

Even though glucuronidations are the most frequent metabolic reactions of conjugation, both in quantitative and qualitative terms, they have rather seldom been investigated using computational approaches. To fill this gap, we have used the manually collected MetaQSAR metabolic reaction database to generate two models for the prediction of UGT-mediated metabolism, both based on molecular descriptors and implementing the Random Forest algorithm. The first model predicts the occurrence of the reaction and was internally validated with a Matthew correlation coefficient (MCC) of 0.76 and an area under the ROC curve (AUC) of 0.94, and further externally validated using a test set composed of 120 additional xenobiotics (MCC of 0.70 and AUC of 0.90). The second model distinguishes between O- and N-glucuronidations and was optimized by the random undersampling procedure to improve the predictive accuracy during the internal validation, with the recall measure of the minority class increasing from 0.55 to 0.78.
glucuronidation; machine learning; Metabolism; predictive modeling; Random Forest; UGT-mediated metabolism
Settore CHIM/08 - Chimica Farmaceutica
11-apr-2019
12-feb-2019
Article (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/651151
Citazioni
  • ???jsp.display-item.citation.pmc??? 3
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 11
social impact