High-performance prediction models for prostate cancer radiomics

Isaksson, L.J.; Repetto, M.; Summers, P.E.; Pepa, M.; Zaffaroni, M.; Vincini, M.G.; Corrao, G.; Mazzola, G.C.; Rotondi, M.; Bellerba, F.; Raimondi, S.; Haron, Z.; Alessi, S.; Pricolo, P.; Mistretta, F.A.; Luzzago, S.; Cattani, F.; Musi, G.; De Cobelli, O.; Cremonesi, M.; Orecchia, R.; La Torre, D.; Marvaso, G.; Petralia, G.; Jereczek-Fossa, B.A.

doi:10.1016/j.imu.2023.101161

When researchers are faced with building machine learning (ML) radiomic models, the first choice they have to make is what model to use. Naturally, the goal is to use the model with the best performance. But what is the best model? It is well known in ML that modern techniques such as gradient boosting and deep learning have better capacity than traditional models to solve complex problems in high dimensions. Despite this, most radiomics researchers still do not focus on these models in their research. As access to high-quality and large data sets increase, these high-capacity ML models may become even more relevant. In this article, we use a large dataset of 949 prostate cancer patients to compare the performance of a few of the most promising ML models for tabular data: gradient-boosted decision trees (GBDTs), multilayer perceptions, convolutional neural networks, and transformers. To this end, we predict nine different prostate cancer pathology outcomes of clinical interest. Our goal is to give a rough overview of how these models compare against one another in a typical radiomics setting. We also investigate if multitask learning improves the performance of these models when multiple targets are available. Our results suggest that GBDTs perform well across all targets, and that multitask learning does not provide a consistent improvement.

High-performance prediction models for prostate cancer radiomics / L.J. Isaksson, M. Repetto, P.E. Summers, M. Pepa, M. Zaffaroni, M.G. Vincini, G. Corrao, G.C. Mazzola, M. Rotondi, F. Bellerba, S. Raimondi, Z. Haron, S. Alessi, P. Pricolo, F.A. Mistretta, S. Luzzago, F. Cattani, G. Musi, O. De Cobelli, M. Cremonesi, R. Orecchia, D. La Torre, G. Marvaso, G. Petralia, B.A. Jereczek-Fossa. - In: INFORMATICS IN MEDICINE UNLOCKED. - ISSN 2352-9148. - 37:(2023), pp. 101161.1-101161.9. [10.1016/j.imu.2023.101161]

High-performance prediction models for prostate cancer radiomics

L.J. Isaksson^Primo;M. Repetto^Secondo;Summers P. E.;M. Pepa;M. Zaffaroni;Vincini M. G.;Corrao G.;G.C. Mazzola;M. Rotondi;F. Bellerba;Raimondi S.;Haron Z.;Alessi S.;P. Pricolo;F.A. Mistretta;S. Luzzago;Cattani F.;G. Musi;O. De Cobelli;Cremonesi M.;R. Orecchia;D. La Torre;G. Marvaso;G. Petralia^Penultimo;B.A. Jereczek-Fossa^Ultimo

2023

Abstract

When researchers are faced with building machine learning (ML) radiomic models, the first choice they have to make is what model to use. Naturally, the goal is to use the model with the best performance. But what is the best model? It is well known in ML that modern techniques such as gradient boosting and deep learning have better capacity than traditional models to solve complex problems in high dimensions. Despite this, most radiomics researchers still do not focus on these models in their research. As access to high-quality and large data sets increase, these high-capacity ML models may become even more relevant. In this article, we use a large dataset of 949 prostate cancer patients to compare the performance of a few of the most promising ML models for tabular data: gradient-boosted decision trees (GBDTs), multilayer perceptions, convolutional neural networks, and transformers. To this end, we predict nine different prostate cancer pathology outcomes of clinical interest. Our goal is to give a rough overview of how these models compare against one another in a typical radiomics setting. We also investigate if multitask learning improves the performance of these models when multiple targets are available. Our results suggest that GBDTs perform well across all targets, and that multitask learning does not provide a consistent improvement.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Radiomics; Prostate cancer; Deep learning; Gradient boost
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore MED/24 - Urologia
			
	Data di pubblicazione
	
				2023
			
	Rivista in ANCE
	
				INFORMATICS IN MEDICINE UNLOCKED
			
	DOI
	
				https://dx.doi.org/10.1016/j.imu.2023.101161
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
Musi Gennaro articolo.pdf accesso aperto Tipologia: Publisher's version/PDF Dimensione 1.94 MB Formato Adobe PDF Visualizza/Apri	1.94 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/953149

Citazioni

ND

7

ND

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca