Purpose: This study aimed to identify and preliminary validate distinct clusters of patients with13 cancer based on demographics, clinical characteristics, and symptoms and to inform future research14 on sample size requirements for achieving sufficient power in clustering analyses.15 Methods: This cross-sectional pilot study involved 114 patients with cancer from two hospitals in16 northern Italy. Data were collected on demographics, clinical characteristics, and 20 symptoms17 using the Edmonton Symptom Assessment System in October 2022. t-distributed stochastic18 neighbor embedding (t-SNE) was used to reduce the symptom data and demographics (e.g., age)19 into two components, which were then clustered using Ward’s method. A Monte Carlo simulation20 was conducted based on the t-SNE components to estimate the sample size needed to achieve 80! power for different cluster solutions (k = 2, 3, 4).22 Results: Two distinct clusters were identified: Cluster 1 (Higher Symptom Burden Cluster) and23 Cluster 2 (Lower Symptom Burden Cluster). Cluster 1 patients had a higher prevalence of24 depression, anxiety, and drowsiness. Monte Carlo simulations indicated that 50 patients per cluster25 were sufficient for k = 2 clusters to achieve 80% power, whereas 90 patients per cluster were26 needed for k = 3 clusters and 120 patients per cluster for k = 4 clusters.27 Conclusion: This study identified distinct patient clusters and provided preliminary evidence on the28 sample size required for clustering analyses in cancer research. Understanding patient clusters29 enables nurses to provide tailored interventions, potentially improving symptom management and30 overall patient care.

Patient clusters based on demographics, clinical characteristics, and cancer-related symptoms: a cross-sectional pilot study / G. Ghizzardi, S. Maiandi, D. Vasaturo, C. Collemi, A. Laurano, A. Magon, S. Belloni, D. Sidoli, C. Cascone, L.S. Bassani, S. Calvanese, R. Caruso. - In: EUROPEAN JOURNAL OF ONCOLOGY NURSING. - ISSN 1462-3889. - (2025), pp. 102796.1-102796.32. [Epub ahead of print] [10.1016/j.ejon.2025.102796]

Patient clusters based on demographics, clinical characteristics, and cancer-related symptoms: a cross-sectional pilot study

R. Caruso
Ultimo
Methodology
2025

Abstract

Purpose: This study aimed to identify and preliminary validate distinct clusters of patients with13 cancer based on demographics, clinical characteristics, and symptoms and to inform future research14 on sample size requirements for achieving sufficient power in clustering analyses.15 Methods: This cross-sectional pilot study involved 114 patients with cancer from two hospitals in16 northern Italy. Data were collected on demographics, clinical characteristics, and 20 symptoms17 using the Edmonton Symptom Assessment System in October 2022. t-distributed stochastic18 neighbor embedding (t-SNE) was used to reduce the symptom data and demographics (e.g., age)19 into two components, which were then clustered using Ward’s method. A Monte Carlo simulation20 was conducted based on the t-SNE components to estimate the sample size needed to achieve 80! power for different cluster solutions (k = 2, 3, 4).22 Results: Two distinct clusters were identified: Cluster 1 (Higher Symptom Burden Cluster) and23 Cluster 2 (Lower Symptom Burden Cluster). Cluster 1 patients had a higher prevalence of24 depression, anxiety, and drowsiness. Monte Carlo simulations indicated that 50 patients per cluster25 were sufficient for k = 2 clusters to achieve 80% power, whereas 90 patients per cluster were26 needed for k = 3 clusters and 120 patients per cluster for k = 4 clusters.27 Conclusion: This study identified distinct patient clusters and provided preliminary evidence on the28 sample size required for clustering analyses in cancer research. Understanding patient clusters29 enables nurses to provide tailored interventions, potentially improving symptom management and30 overall patient care.
Cancer-related symptoms; Patient clusters; Person-centered analyses; Patient34 subgroups; t-SNE; Hierarchical clustering; Monte Carlo simulation; Power analysis; Nursing35 practice
Settore MEDS-24/C - Scienze infermieristiche generali, cliniche, pediatriche e ostetrico-ginecologiche e neonatali
2025
gen-2025
Article (author)
File in questo prodotto:
File Dimensione Formato  
PIIS1462388925000201 (1).pdf

accesso riservato

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 3.47 MB
Formato Adobe PDF
3.47 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1138016
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact