Prediction of UGT-mediated Metabolism Using the Manually Curated MetaQSAR Database

Mazzolari, A.; Afzal, A.M.; Pedretti, A.; Testa, B.; Vistoli, G.; Bender, A.

doi:10.1021/acsmedchemlett.8b00603

Even though glucuronidations are the most frequent metabolic reactions of conjugation, both in quantitative and qualitative terms, they have rather seldom been investigated using computational approaches. To fill this gap, we have used the manually collected MetaQSAR metabolic reaction database to generate two models for the prediction of UGT-mediated metabolism, both based on molecular descriptors and implementing the Random Forest algorithm. The first model predicts the occurrence of the reaction and was internally validated with a Matthew correlation coefficient (MCC) of 0.76 and an area under the ROC curve (AUC) of 0.94, and further externally validated using a test set composed of 120 additional xenobiotics (MCC of 0.70 and AUC of 0.90). The second model distinguishes between O- and N-glucuronidations and was optimized by the random undersampling procedure to improve the predictive accuracy during the internal validation, with the recall measure of the minority class increasing from 0.55 to 0.78.

Prediction of UGT-mediated Metabolism Using the Manually Curated MetaQSAR Database / A. Mazzolari, A.M. Afzal, A. Pedretti, B. Testa, G. Vistoli, A. Bender. - In: ACS MEDICINAL CHEMISTRY LETTERS. - ISSN 1948-5875. - 10:4(2019 Apr 11), pp. 633-638.

Prediction of UGT-mediated Metabolism Using the Manually Curated MetaQSAR Database

A. Mazzolari^Primo;Afzal A. M.^Secondo;A. Pedretti;Testa B.;G. Vistoli^Penultimo;Bender A.^Ultimo

2019

Abstract

Even though glucuronidations are the most frequent metabolic reactions of conjugation, both in quantitative and qualitative terms, they have rather seldom been investigated using computational approaches. To fill this gap, we have used the manually collected MetaQSAR metabolic reaction database to generate two models for the prediction of UGT-mediated metabolism, both based on molecular descriptors and implementing the Random Forest algorithm. The first model predicts the occurrence of the reaction and was internally validated with a Matthew correlation coefficient (MCC) of 0.76 and an area under the ROC curve (AUC) of 0.94, and further externally validated using a test set composed of 120 additional xenobiotics (MCC of 0.70 and AUC of 0.90). The second model distinguishes between O- and N-glucuronidations and was optimized by the random undersampling procedure to improve the predictive accuracy during the internal validation, with the recall measure of the minority class increasing from 0.55 to 0.78.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				glucuronidation; machine learning; Metabolism; predictive modeling; Random Forest; UGT-mediated metabolism
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore CHIM/08 - Chimica Farmaceutica
			
	Data di pubblicazione
	
				11-apr-2019
			
	Data ahead of print o data di stampa
	
				12-feb-2019
			
	Rivista in ANCE
	
				ACS MEDICINAL CHEMISTRY LETTERS
			
	DOI
	
				https://dx.doi.org/10.1021/acsmedchemlett.8b00603
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/651151

Citazioni

3

11

11

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca