Quality assurance for automatically generated contours with additional deep learning

Isaksson, L.; Summers, P.; Bhalerao, A.; Gandini, S.; Raimondi, S.; Pepa, M.; Zaffaroni, M.; Corrao, G.; Mazzola, G.; Rotondi, M.; Lo Presti, G.; Haron, Z.; Alessi, S.; Pricolo, P.; Mistretta, F.; Luzzago, S.; Cattani, F.; Musi, G.; De Cobelli, O.; Cremonesi, M.; Orecchia, R.; Marvaso, G.; Petralia, G.; Jereczek-Fossa, B.

doi:10.1186/s13244-022-01276-7

Objective Deploying an automatic segmentation model in practice should require rigorous quality assurance (QA) and continuous monitoring of the model's use and performance, particularly in high-stakes scenarios such as healthcare. Currently, however, tools to assist with QA for such models are not available to AI researchers. In this work, we build a deep learning model that estimates the quality of automatically generated contours. Methods The model was trained to predict the segmentation quality by outputting an estimate of the Dice similarity coefficient given an image contour pair as input. Our dataset contained 60 axial T2-weighted MRI images of prostates with ground truth segmentations along with 80 automatically generated segmentation masks. The model we used was a 3D version of the EfficientDet architecture with a custom regression head. For validation, we used a fivefold cross-validation. To counteract the limitation of the small dataset, we used an extensive data augmentation scheme capable of producing virtually infinite training samples from a single ground truth label mask. In addition, we compared the results against a baseline model that only uses clinical variables for its predictions. Results Our model achieved a mean absolute error of 0.020 +/- 0.026 (2.2% mean percentage error) in estimating the Dice score, with a rank correlation of 0.42. Furthermore, the model managed to correctly identify incorrect segmentations (defined in terms of acceptable/unacceptable) 99.6% of the time. Conclusion We believe that the trained model can be used alongside automatic segmentation tools to ensure quality and thus allow intervention to prevent undesired segmentation behavior.

Quality assurance for automatically generated contours with additional deep learning / L. Isaksson, P. Summers, A. Bhalerao, S. Gandini, S. Raimondi, M. Pepa, M. Zaffaroni, G. Corrao, G. Mazzola, M. Rotondi, G. Lo Presti, Z. Haron, S. Alessi, P. Pricolo, F. Mistretta, S. Luzzago, F. Cattani, G. Musi, O. De Cobelli, M. Cremonesi, R. Orecchia, G. Marvaso, G. Petralia, B. Jereczek-Fossa. - In: INSIGHTS INTO IMAGING. - ISSN 1869-4101. - 13:(2022), pp. 137.1-137.10. [10.1186/s13244-022-01276-7]

Quality assurance for automatically generated contours with additional deep learning

L. Isaksson^Primo;Summers, P;Bhalerao, A;Gandini, S;Raimondi, S;Pepa, M;Zaffaroni, M;Corrao, G;Mazzola, GC;Rotondi, M;Lo Presti, G;Haron, Z;Alessi, S;Pricolo, P;F. Mistretta;S. Luzzago;Cattani, F;G. Musi;O. De Cobelli;Cremonesi, M;Orecchia, R;G. Marvaso;G. Petralia;B. Jereczek-Fossa^Ultimo

2022

Abstract

Objective Deploying an automatic segmentation model in practice should require rigorous quality assurance (QA) and continuous monitoring of the model's use and performance, particularly in high-stakes scenarios such as healthcare. Currently, however, tools to assist with QA for such models are not available to AI researchers. In this work, we build a deep learning model that estimates the quality of automatically generated contours. Methods The model was trained to predict the segmentation quality by outputting an estimate of the Dice similarity coefficient given an image contour pair as input. Our dataset contained 60 axial T2-weighted MRI images of prostates with ground truth segmentations along with 80 automatically generated segmentation masks. The model we used was a 3D version of the EfficientDet architecture with a custom regression head. For validation, we used a fivefold cross-validation. To counteract the limitation of the small dataset, we used an extensive data augmentation scheme capable of producing virtually infinite training samples from a single ground truth label mask. In addition, we compared the results against a baseline model that only uses clinical variables for its predictions. Results Our model achieved a mean absolute error of 0.020 +/- 0.026 (2.2% mean percentage error) in estimating the Dice score, with a rank correlation of 0.42. Furthermore, the model managed to correctly identify incorrect segmentations (defined in terms of acceptable/unacceptable) 99.6% of the time. Conclusion We believe that the trained model can be used alongside automatic segmentation tools to ensure quality and thus allow intervention to prevent undesired segmentation behavior.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Quality assurance (Health care); Confidence calibration; Diagnostic imaging; Prostate; Magnetic resonance imaging
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore MED/24 - Urologia
Settore MED/36 - Diagnostica per Immagini e Radioterapia
			
	Data di pubblicazione
	
				2022
			
	Data ahead of print o data di stampa
	
				17-ago-2022
			
	Rivista in ANCE
	
				INSIGHTS INTO IMAGING
			
	DOI
	
				https://dx.doi.org/10.1186/s13244-022-01276-7
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
s13244-022-01276-7.pdf accesso aperto Tipologia: Publisher's version/PDF Dimensione 1.35 MB Formato Adobe PDF Visualizza/Apri	1.35 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/948548

Citazioni

ND

10

8

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca