IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

'Big Data' techniques are often adopted in cross-organization scenarios for integrating multiple data sources to extract statistics or other latent information. Even if these techniques do not require the support of a schema for processing data, a common conceptual model is typically defined to address name resolution. This implies that each local source is tasked of applying a semantic lifting procedure for expressing the local data in term of the common model. Semantic heterogeneity is then potentially introduced in data. In this paper we illustrate a methodology designed to the implementation of consistent process mining algorithms in a 'Big Data' context. In particular, we exploit two different procedures. The first one is aimed at computing the mismatch among the data sources to be integrated. The second uses mismatch values to extend data to be processed with a traditional map reduce algorithm.

Consistent process mining over big data triple stores / A. Azzini, P. Ceravolo - In: 2013 IEEE International congress on big data : 28 june – 3 july 2013, Santa Clara, California : proceedingsLos Alamitos : IEEE computer society, 2013. - ISBN 9780768550060. - pp. 54-61 (( convegno IEEE International Congress on Big Data tenutosi a Santa Clara, USA nel 2013.

Consistent process mining over big data triple stores

A. Azzini^Primo;P. Ceravolo^Ultimo

2013

Abstract

'Big Data' techniques are often adopted in cross-organization scenarios for integrating multiple data sources to extract statistics or other latent information. Even if these techniques do not require the support of a schema for processing data, a common conceptual model is typically defined to address name resolution. This implies that each local source is tasked of applying a semantic lifting procedure for expressing the local data in term of the common model. Semantic heterogeneity is then potentially introduced in data. In this paper we illustrate a methodology designed to the implementation of consistent process mining algorithms in a 'Big Data' context. In particular, we exploit two different procedures. The first one is aimed at computing the mismatch among the data sources to be integrated. The second uses mismatch values to extend data to be processed with a traditional map reduce algorithm.

Scheda breve

Scheda completa

Scheda completa (DC)

	Settori scientifico-disciplinari del contributo
	
			Settore INF/01 - Informatica
		
	Data di pubblicazione
	
			2013
		
	DOI
	
			https://dx.doi.org/10.1109/BigData.Congress.2013.17
		
	Tipologia
	
			Book Part (author)
		
	Appare nelle tipologie:
	
			03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
bigdata.pdf accesso aperto Tipologia: Pre-print (manoscritto inviato all'editore) Dimensione 600.99 kB Formato Adobe PDF Visualizza/Apri	600.99 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/224622

Citazioni

ND

30

12

social impact