A fundamental feature of the software process consists in its own stochastic in nature. A convenient approach for extracting the stochastic dynamics of a process from extended log data is that of modelling the process as a Markov Model, so that the discovery of the short/medium range dynamics of the process is cast in terms of the learning of Markov Models of different orders, or in terms of learning the corresponding transition matrices. In this paper we show that the use of a full Bayesian approach in the learning process helps providing robustness against statistical noise and over-fitting, as the size of a transition matrix grows exponentially with the order of the model. We give a specific model-model similarity definition and the corresponding calculation procedure to be used in model-to-sequence or sequence-to-sequence conformance assessment, which could also be applied to other inferential tasks, such as unsupervised process learning.

Discovering the software process by means of stochastic workflow analysis / Alberto Colombo, Ernesto Damiani, Gabriele Gianini. - In: JOURNAL OF SYSTEMS ARCHITECTURE. - ISSN 1383-7621. - 52:11(2006), pp. 684-692.

Discovering the software process by means of stochastic workflow analysis

Alberto Colombo;Ernesto Damiani;Gabriele Gianini
2006

Abstract

A fundamental feature of the software process consists in its own stochastic in nature. A convenient approach for extracting the stochastic dynamics of a process from extended log data is that of modelling the process as a Markov Model, so that the discovery of the short/medium range dynamics of the process is cast in terms of the learning of Markov Models of different orders, or in terms of learning the corresponding transition matrices. In this paper we show that the use of a full Bayesian approach in the learning process helps providing robustness against statistical noise and over-fitting, as the size of a transition matrix grows exponentially with the order of the model. We give a specific model-model similarity definition and the corresponding calculation procedure to be used in model-to-sequence or sequence-to-sequence conformance assessment, which could also be applied to other inferential tasks, such as unsupervised process learning.
Bayesian methods; Machine learning; Markov chains; Similarity measures; Software process; Stochastic dynamics; Time-sequence analysis; Workflow management
Settore INF/01 - Informatica
2006
Article (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/29586
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 7
social impact