Machine learning models are routinely integrated into process mining pipelines to carry out tasks like data transformation, noise reduction, anomaly detection, classification, and prediction. Often, the design of such models is based on some ad-hoc assumptions about the corresponding data distributions, which are not necessarily in accordance with the non-parametric distributions typically observed with process data. Moreover, mainstream machine-learning approaches tend to ignore the challenges posed by concurrency in operational processes. Data encoding is a key element to smooth the mismatch between these assumptions but its potential is poorly exploited. In this paper, we argue that a deeper understanding of the challenges associated with training machine learning models on process data is essential for establishing a robust integration of process mining and machine learning. Our analysis aims to lay the groundwork for a methodology that aligns machine learning with process mining requirements. We encourage further research in this direction to advance the field and effectively address these critical issues.

Tuning Machine Learning to Address Process Mining Requirements / P. Ceravolo, S.B. Junior, E. Damiani, W. Van Der Aalst. - In: IEEE ACCESS. - ISSN 2169-3536. - 12:(2024 Feb 02), pp. 24583-24595. [10.1109/ACCESS.2024.3361650]

Tuning Machine Learning to Address Process Mining Requirements

P. Ceravolo
Primo
;
E. Damiani
Penultimo
;
2024

Abstract

Machine learning models are routinely integrated into process mining pipelines to carry out tasks like data transformation, noise reduction, anomaly detection, classification, and prediction. Often, the design of such models is based on some ad-hoc assumptions about the corresponding data distributions, which are not necessarily in accordance with the non-parametric distributions typically observed with process data. Moreover, mainstream machine-learning approaches tend to ignore the challenges posed by concurrency in operational processes. Data encoding is a key element to smooth the mismatch between these assumptions but its potential is poorly exploited. In this paper, we argue that a deeper understanding of the challenges associated with training machine learning models on process data is essential for establishing a robust integration of process mining and machine learning. Our analysis aims to lay the groundwork for a methodology that aligns machine learning with process mining requirements. We encourage further research in this direction to advance the field and effectively address these critical issues.
concurrency; encoding; machine learning; non-parametric distribution; non-stationary; Process mining; training; zero-shot learning; Generative Adversarial Networks; Latent Space; Business Processes; Executive Order; Cases Of Events; Event Log; Zero-shot; Concept Drift; Business Process Management; Concurrent Activation; Machine Learning Tasks; Basic Notions; Event Stream; Long Short-term Memory; Hyperparameter Tuning;
Settore INF/01 - Informatica
   MUSA - Multilayered Urban Sustainability Actiona
   MUSA
   MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
2-feb-2024
https://ieeexplore.ieee.org/document/10418930/
Article (author)
File in questo prodotto:
File Dimensione Formato  
Tuning_Machine_Learning_to_Address_Process_Mining_Requirements.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 1.5 MB
Formato Adobe PDF
1.5 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1030130
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact