Comparing Performances of Predictive Models of Toxicity after Radiotherapy for Breast Cancer Using Different Machine Learning Approaches

Maria Giulia Ubeira-Gabellini,; Mori, M.; Palazzo, G.; Cicchetti, A.; Mangili, P.; Pavarini, M.; Rancati, T.; Fodor, A.; Antonella Del Vecchio,; Nadia Gisella Di Muzio,; Fiorino, C.

doi:10.3390/cancers16050934

Purpose. Different ML models were compared to predict toxicity in RT on a large cohort (n = 1314). Methods. The endpoint was RTOG G2/G3 acute toxicity, resulting in 204/1314 patients with the event. The dataset, including 25 clinical, anatomical, and dosimetric features, was split into 984 for training and 330 for internal tests. The dataset was standardized; features with a high p-value at univariate LR and with Spearman ρ > 0.8 were excluded; synthesized data of the minority were generated to compensate for class imbalance. Twelve ML methods were considered. Model optimization and sequential backward selection were run to choose the best models with a parsimonious feature number. Finally, feature importance was derived for every model. Results. The model’s performance was compared on a training–test dataset over different metrics: the best performance model was LightGBM. Logistic regression with three variables (LR3) selected via bootstrapping showed performances similar to the best-performing models. The AUC of test data is slightly above 0.65 for the best models (highest value: 0.662 with LightGBM). Conclusions. No model performed the best for all metrics: more complex ML models had better performances; however, models with just three features showed performances comparable to the best models using many (n = 13–19) features.

Comparing Performances of Predictive Models of Toxicity after Radiotherapy for Breast Cancer Using Different Machine Learning Approaches / M. Giulia Ubeira-Gabellini, M. Mori, G. Palazzo, A. Cicchetti, P. Mangili, M. Pavarini, T. Rancati, A. Fodor, A. del Vecchio, N. Gisella Di Muzio, C. Fiorino. - In: CANCERS. - ISSN 2072-6694. - 16:5(2024), pp. 934.1-934.24. [10.3390/cancers16050934]

Comparing Performances of Predictive Models of Toxicity after Radiotherapy for Breast Cancer Using Different Machine Learning Approaches

Maria Giulia Ubeira-Gabellini;M. Mori^{Secondo

Investigation};Gabriele Palazzo;Alessandro Cicchetti;Paola Mangili;Maddalena Pavarini;Tiziana Rancati;Andrei Fodor;Antonella del Vecchio;Nadia Gisella Di Muzio;Claudio Fiorino

2024

Abstract

Purpose. Different ML models were compared to predict toxicity in RT on a large cohort (n = 1314). Methods. The endpoint was RTOG G2/G3 acute toxicity, resulting in 204/1314 patients with the event. The dataset, including 25 clinical, anatomical, and dosimetric features, was split into 984 for training and 330 for internal tests. The dataset was standardized; features with a high p-value at univariate LR and with Spearman ρ > 0.8 were excluded; synthesized data of the minority were generated to compensate for class imbalance. Twelve ML methods were considered. Model optimization and sequential backward selection were run to choose the best models with a parsimonious feature number. Finally, feature importance was derived for every model. Results. The model’s performance was compared on a training–test dataset over different metrics: the best performance model was LightGBM. Logistic regression with three variables (LR3) selected via bootstrapping showed performances similar to the best-performing models. The AUC of test data is slightly above 0.65 for the best models (highest value: 0.662 with LightGBM). Conclusions. No model performed the best for all metrics: more complex ML models had better performances; however, models with just three features showed performances comparable to the best models using many (n = 13–19) features.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				AI models; early-stage breast cancer; modeling; radiotherapy; toxicity
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore FIS/07 - Fisica Applicata(Beni Culturali, Ambientali, Biol.e Medicin)
			
	Titolo del progetto
	
	Titolo Progetto
	
									ERA-Net Cofund in Personalised Medicine
								
	Acronimo
	
									ERA PerMed
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									779282
								
	Data di pubblicazione
	
				2024
			
	Rivista in ANCE
	
				CANCERS
			
	DOI
	
				https://dx.doi.org/10.3390/cancers16050934
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
Ubeira_Gabellini_Cancers 2024.pdf accesso aperto Descrizione: Article Tipologia: Publisher's version/PDF Dimensione 4.54 MB Formato Adobe PDF Visualizza/Apri	4.54 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1033301

Citazioni

ND

7

9

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca