Real-life event logs are typically much less structured and more complex than the predefined business activities they refer to. Most of the existing process mining techniques assume that there is a one-to-one mapping between process model activities and events recorded during process execution. Unfortunately, event logs and process model activities are defined at different levels of granularity. The challenges posed by this discrepancy can be addressed by means of log-lifting. In this work, we develop a machine-learning-based framework aimed at bridging the abstraction level gap between logs and process models. The proposed framework operates in two main phases: log segmentation and machine-learning-based classification. The purpose of the segmentation phase is to identify the potential segment separators in a flow of low-level events, in which each segment corresponds to an unknown high-level activity. For this, we propose a segmentation algorithm based on maximum likelihood with n-gram analysis. In the second phase, event segments are mapped into their corresponding high-level activities using a supervised machine learning technique. Several machine learning classification methods are explored including ANNs, SVMs, and random forest. We demonstrate the applicability of our framework using a real-life event log provided by the SAP company. The results obtained show that a machine learning approach based on the random forest algorithm outperforms the other methods with an accuracy of 96.4%. The testing time was found to be around 0.01s, which makes the algorithm a good candidate for real-time deployment scenarios.

Machine Learning-Based Framework for Log-Lifting in Business Process Mining Applications / G. Tello, G. Gianini, R. Mizouni, E. Damiani (LECTURE NOTES IN ARTIFICIAL INTELLIGENCE). - In: Business Process Management[s.l] : Springer International Publishing, 2019. - ISBN 9783030266189. - pp. 232-249 (( Intervento presentato al 17. convegno International Conference on Business Process Management tenutosi a Wien nel 2019 [10.1007/978-3-030-26619-6_16].

Machine Learning-Based Framework for Log-Lifting in Business Process Mining Applications

G. Gianini
Secondo
;
E. Damiani
Ultimo
2019

Abstract

Real-life event logs are typically much less structured and more complex than the predefined business activities they refer to. Most of the existing process mining techniques assume that there is a one-to-one mapping between process model activities and events recorded during process execution. Unfortunately, event logs and process model activities are defined at different levels of granularity. The challenges posed by this discrepancy can be addressed by means of log-lifting. In this work, we develop a machine-learning-based framework aimed at bridging the abstraction level gap between logs and process models. The proposed framework operates in two main phases: log segmentation and machine-learning-based classification. The purpose of the segmentation phase is to identify the potential segment separators in a flow of low-level events, in which each segment corresponds to an unknown high-level activity. For this, we propose a segmentation algorithm based on maximum likelihood with n-gram analysis. In the second phase, event segments are mapped into their corresponding high-level activities using a supervised machine learning technique. Several machine learning classification methods are explored including ANNs, SVMs, and random forest. We demonstrate the applicability of our framework using a real-life event log provided by the SAP company. The results obtained show that a machine learning approach based on the random forest algorithm outperforms the other methods with an accuracy of 96.4%. The testing time was found to be around 0.01s, which makes the algorithm a good candidate for real-time deployment scenarios.
Process mining; Segmentation; Log-lifting; Machine learning
Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
2019
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
2019 - Ghalia Tello - Machine Learning-Based Framework for Log-Lifting_PUBLISHED_VERSION.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 837.59 kB
Formato Adobe PDF
837.59 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/673293
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 21
  • ???jsp.display-item.citation.isi??? 9
social impact