A fundamental feature of the software process consists in its own stochastic in nature. A convenient approach for extracting the stochastic dynamics of a process from extended log data is that of modelling the process as a Markov Model, so that the discovery of the short/medium range dynamics of the process is cast in terms of the learning of Markov Models of different orders, or in terms of learning the corresponding transition matrices. In this paper we show that the use of a full Bayesian approach in the learning process helps providing robustness against statistical noise and over-fitting, as the size of a transition matrix grows exponentially with the order of the model. We give a specific model-model similarity definition and the corresponding calculation procedure to be used in model-to-sequence or sequence-to-sequence conformance assessment, which could also be applied to other inferential tasks, such as unsupervised process learning.
Discovering the software process by means of stochastic workflow analysis / Alberto Colombo, Ernesto Damiani, Gabriele Gianini. - In: JOURNAL OF SYSTEMS ARCHITECTURE. - ISSN 1383-7621. - 52:11(2006), pp. 684-692.
Discovering the software process by means of stochastic workflow analysis
Alberto Colombo;Ernesto Damiani;Gabriele Gianini
2006
Abstract
A fundamental feature of the software process consists in its own stochastic in nature. A convenient approach for extracting the stochastic dynamics of a process from extended log data is that of modelling the process as a Markov Model, so that the discovery of the short/medium range dynamics of the process is cast in terms of the learning of Markov Models of different orders, or in terms of learning the corresponding transition matrices. In this paper we show that the use of a full Bayesian approach in the learning process helps providing robustness against statistical noise and over-fitting, as the size of a transition matrix grows exponentially with the order of the model. We give a specific model-model similarity definition and the corresponding calculation procedure to be used in model-to-sequence or sequence-to-sequence conformance assessment, which could also be applied to other inferential tasks, such as unsupervised process learning.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.