In this work we describe the application of a careflow mining algorithm to detect the most frequent patterns of care in a cohort of 3000 breast cancer patients. The applied method relies on longitudinal data extracted from electronic health records, recorded from the first surgical procedure after a breast cancer diagnosis. Careflows are mined from events data recorded for administrative purposes, including procedures from ICD9 – CM billing codes and chemotherapy treatments. Events data have been pre-processed with Topic Modelling to create composite events based on concurrent procedures. The results of the careflow mining algorithm allow the discovery of electronic temporal phenotypes across the studied population. These phenotypes are further characterized on the basis of clinical traits and tumour histopathology, as well as in terms of relapses, metastasis occurrence and 5-year survival rates. Results are highly significant from a clinical perspective, since phenotypes describe well characterized pathology classes, and the careflows are well matched with existing clinical guidelines. The analysis thus facilitates deriving real-world evidence that can inform clinicians as well as hospital decision makers.

Mining post-surgical care processes in breast cancer patients / L. Chiudinelli, A. Dagliati, V. Tibollo, S. Albasini, N. Geifman, N. Peek, J.H. Holmes, F. Corsi, R. Bellazzi, L. Sacchi. - In: ARTIFICIAL INTELLIGENCE IN MEDICINE. - ISSN 0933-3657. - 105(2020 May). [10.1016/j.artmed.2020.101855]

Mining post-surgical care processes in breast cancer patients

F. Corsi;
2020

Abstract

In this work we describe the application of a careflow mining algorithm to detect the most frequent patterns of care in a cohort of 3000 breast cancer patients. The applied method relies on longitudinal data extracted from electronic health records, recorded from the first surgical procedure after a breast cancer diagnosis. Careflows are mined from events data recorded for administrative purposes, including procedures from ICD9 – CM billing codes and chemotherapy treatments. Events data have been pre-processed with Topic Modelling to create composite events based on concurrent procedures. The results of the careflow mining algorithm allow the discovery of electronic temporal phenotypes across the studied population. These phenotypes are further characterized on the basis of clinical traits and tumour histopathology, as well as in terms of relapses, metastasis occurrence and 5-year survival rates. Results are highly significant from a clinical perspective, since phenotypes describe well characterized pathology classes, and the careflows are well matched with existing clinical guidelines. The analysis thus facilitates deriving real-world evidence that can inform clinicians as well as hospital decision makers.
Breast cancer; Electronic Health Records; Latent Dirichlet Allocation; Process Mining; Temporal Data Analytics; Temporal Electronic Phenotyping; Topic Modelling
Settore MED/18 - Chirurgia Generale
mag-2020
15-apr-2020
Article (author)
File in questo prodotto:
File Dimensione Formato  
Mining.pdf

Open Access dal 14/04/2021

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 1.37 MB
Formato Adobe PDF
1.37 MB Adobe PDF Visualizza/Apri
1-s2.0-S0933365719306682-main.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 6.41 MB
Formato Adobe PDF
6.41 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/751260
Citazioni
  • ???jsp.display-item.citation.pmc??? 3
  • Scopus 18
  • ???jsp.display-item.citation.isi??? 12
social impact