Real-life processes are typically less structured and more complex than expected by stakeholders. For this reason, process discovery techniques often deliver models less understandable and useful than expected. In order to address this issue, we propose a method based on statistical inference for pre-processing event logs. We measure the distance between different segments of the event log, computing the probability distribution of observing activities in specific positions. Because segments are generated based on time-domain, business rules or business management system properties, we get a characterisation of these segments in terms of both business and process aspects. We demonstrate the applicability of this approach by developing a case study with real-life event logs and showing that our method is offering interesting properties in term of computational complexity.
Toward a new generation of log pre-processing methods for process mining / P. Ceravolo, E. Damiani, M. Torabi, S. Barbon (LECTURE NOTES IN BUSINESS INFORMATION PROCESSING). - In: Business Process Management Forum / [a cura di] J. Carmona, G. Engels, A. Kumar. - [s.l] : Springer Verlag, 2017. - ISBN 9783319650142. - pp. 55-70 (( Intervento presentato al 15. convegno International Conference on Business Process Management tenutosi a Barcelona nel 2017 [10.1007/978-3-319-65015-9_4].
Toward a new generation of log pre-processing methods for process mining
P. CeravoloPrimo
;E. DamianiSecondo
;
2017
Abstract
Real-life processes are typically less structured and more complex than expected by stakeholders. For this reason, process discovery techniques often deliver models less understandable and useful than expected. In order to address this issue, we propose a method based on statistical inference for pre-processing event logs. We measure the distance between different segments of the event log, computing the probability distribution of observing activities in specific positions. Because segments are generated based on time-domain, business rules or business management system properties, we get a characterisation of these segments in terms of both business and process aspects. We demonstrate the applicability of this approach by developing a case study with real-life event logs and showing that our method is offering interesting properties in term of computational complexity.File | Dimensione | Formato | |
---|---|---|---|
main.pdf
accesso aperto
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione
588.12 kB
Formato
Adobe PDF
|
588.12 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.