External validation of radiomics-based predictive models in low-dose CT screening for early lung cancer diagnosis

Garau, N.; Paganelli, C.; Summers, P.; Choi, W.; Alam, S.; Lu, W.; Fanciullo, C.; Bellomi, M.; Baroni, G.; Rampinelli, C.

doi:10.1002/mp.14308

Purpose: Low-dose CT screening allows early lung cancer detection, but is affected by frequent false positive results, inter/intra observer variation and uncertain diagnoses of lung nodules. Radiomics-based models have recently been introduced to overcome these issues, but limitations in demonstrating their generalizability on independent datasets are slowing their introduction to clinic. The aim of this study is to evaluate two radiomics-based models to classify malignant pulmonary nodules in low-dose CT screening, and to externally validate them on an independent cohort. The effect of a radiomics features harmonization technique is also investigated to evaluate its impact on the classification of lung nodules from a multicenter data. Methods: Pulmonary nodules from two independent cohorts were considered in this study; the first cohort (110 subjects, 113 nodules) was used to train prediction models, and the second cohort (72 nodules) to externally validate them. Literature-based radiomics features were extracted and, after feature selection, used as predictive variables in models for malignancy identification. An in-house prediction model based on artificial neural network (ANN) was implemented and evaluated, along with an alternative model from the literature, based on a support vector machine (SVM) classifier coupled with a least absolute shrinkage and selection operator (LASSO). External validation was performed on the second cohort to evaluate models’ generalization ability. Additionally, the impact of the Combat harmonization method was investigated to compensate for multicenter datasets variabilities. A new training of the models based on harmonized features was performed on the first cohort, then tested separately on the harmonized and non-harmonized features of the second cohort. Results: Preliminary results showed a good accuracy of the investigated models in distinguishing benign from malignant pulmonary nodules with both sets of radiomics features (i.e., non-harmonized and harmonized). The performance of the models, quantified in terms of Area Under the Curve (AUC), was > 0.89 in the training set and > 0.82 in the external validation set for all the investigated scenarios, outperforming the clinical standard (AUC of 0.76). Slightly higher performance was observed for the SVM-LASSO model than the ANN in the external dataset, although they did not result significantly different. For both harmonized and non-harmonized features, no statistical difference was found between Receiver operating characteristic (ROC) curves related to training and test set for both models. Conclusions: Although no significant improvements were observed when applying the Combat harmonization method, both in-house and literature-based models were able to classify lung nodules with good generalization to an independent dataset, thus showing their potential as tools for clinical decision-making in lung cancer screening.difference was found between Receiver operating characteristic (ROC) curves related to training andtest set for both models.Conclusions:Although no significant improvements were observed when applying the Combat har-monization method, both in-house and literature-based models were able to classify lung noduleswith good generalization to an independent dataset, thus showing their potential as tools for clinicaldecision-making in lung cancer screening.

External validation of radiomics-based predictive models in low-dose CT screening for early lung cancer diagnosis / N. Garau, C. Paganelli, P. Summers, W. Choi, S. Alam, W. Lu, C. Fanciullo, M. Bellomi, G. Baroni, C. Rampinelli. - In: MEDICAL PHYSICS. - ISSN 0094-2405. - (2020). [Epub ahead of print] [10.1002/mp.14308]

External validation of radiomics-based predictive models in low-dose CT screening for early lung cancer diagnosis

Garau N.;C. Paganelli;Summers P.;Choi W.;Alam S.;Lu W.;C. Fanciullo;M. Bellomi;Baroni G.;Rampinelli C.

2020

Abstract

Purpose: Low-dose CT screening allows early lung cancer detection, but is affected by frequent false positive results, inter/intra observer variation and uncertain diagnoses of lung nodules. Radiomics-based models have recently been introduced to overcome these issues, but limitations in demonstrating their generalizability on independent datasets are slowing their introduction to clinic. The aim of this study is to evaluate two radiomics-based models to classify malignant pulmonary nodules in low-dose CT screening, and to externally validate them on an independent cohort. The effect of a radiomics features harmonization technique is also investigated to evaluate its impact on the classification of lung nodules from a multicenter data. Methods: Pulmonary nodules from two independent cohorts were considered in this study; the first cohort (110 subjects, 113 nodules) was used to train prediction models, and the second cohort (72 nodules) to externally validate them. Literature-based radiomics features were extracted and, after feature selection, used as predictive variables in models for malignancy identification. An in-house prediction model based on artificial neural network (ANN) was implemented and evaluated, along with an alternative model from the literature, based on a support vector machine (SVM) classifier coupled with a least absolute shrinkage and selection operator (LASSO). External validation was performed on the second cohort to evaluate models’ generalization ability. Additionally, the impact of the Combat harmonization method was investigated to compensate for multicenter datasets variabilities. A new training of the models based on harmonized features was performed on the first cohort, then tested separately on the harmonized and non-harmonized features of the second cohort. Results: Preliminary results showed a good accuracy of the investigated models in distinguishing benign from malignant pulmonary nodules with both sets of radiomics features (i.e., non-harmonized and harmonized). The performance of the models, quantified in terms of Area Under the Curve (AUC), was > 0.89 in the training set and > 0.82 in the external validation set for all the investigated scenarios, outperforming the clinical standard (AUC of 0.76). Slightly higher performance was observed for the SVM-LASSO model than the ANN in the external dataset, although they did not result significantly different. For both harmonized and non-harmonized features, no statistical difference was found between Receiver operating characteristic (ROC) curves related to training and test set for both models. Conclusions: Although no significant improvements were observed when applying the Combat harmonization method, both in-house and literature-based models were able to classify lung nodules with good generalization to an independent dataset, thus showing their potential as tools for clinical decision-making in lung cancer screening.difference was found between Receiver operating characteristic (ROC) curves related to training andtest set for both models.Conclusions:Although no significant improvements were observed when applying the Combat har-monization method, both in-house and literature-based models were able to classify lung noduleswith good generalization to an independent dataset, thus showing their potential as tools for clinicaldecision-making in lung cancer screening.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				low-dose CT screening; lung nodules classification; radiomics;
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore MED/36 - Diagnostica per Immagini e Radioterapia
			
	Data di pubblicazione
	
				2020
			
	Data ahead of print o data di stampa
	
				giu-2020
			
	Rivista in ANCE
	
				MEDICAL PHYSICS
			
	DOI
	
				https://dx.doi.org/10.1002/mp.14308
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
mp.14308.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 1.79 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.79 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/757445

Citazioni

7

37

34

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca