Toward a General Framework for Multimodal Big Data Analysis

Bellandi, V.; Ceravolo, P.; Maghool, S.; Siccardi, S.

doi:10.1089/big.2021.0326

Multimodal Analytics in Big Data architectures implies compounded configurations of the data processing tasks. Each modality in data requires specific analytics that triggers specific data processing tasks. Scalability can be reached at the cost of an attentive calibration of the resources shared by the different tasks searching for a trade-off with the multiple requirements they impose. We propose a methodology to address multimodal analytics within the same data processing approach to get a simplified architecture that can fully exploit the potential of the parallel processing of Big Data infrastructures. Multiple data sources are first integrated into a unified knowledge graph (KG). Different modalities of data are addressed by specifying ad hoc views on the KG and producing a rewriting of the graph containing merely the data to be processed. Graph traversal and rule extraction are this way boosted. Using graph embeddings methods, the different ad hoc views can be transformed into low-dimensional representation following the same data format. This way a single machine learning procedure can address the different modalities, simplifying the architecture of our system. The experiments we executed demonstrate that our approach reduces the cost of execution and improves the accuracy of analytics.

Toward a General Framework for Multimodal Big Data Analysis / V. Bellandi, P. Ceravolo, S. Maghool, S. Siccardi. - In: BIG DATA. - ISSN 2167-6461. - 10:5(2022), pp. 408-424. [10.1089/big.2021.0326]

Toward a General Framework for Multimodal Big Data Analysis

V. Bellandi^Primo;P. Ceravolo^Secondo;S. Maghool^Penultimo;S. Siccardi^Ultimo

2022

Abstract

Multimodal Analytics in Big Data architectures implies compounded configurations of the data processing tasks. Each modality in data requires specific analytics that triggers specific data processing tasks. Scalability can be reached at the cost of an attentive calibration of the resources shared by the different tasks searching for a trade-off with the multiple requirements they impose. We propose a methodology to address multimodal analytics within the same data processing approach to get a simplified architecture that can fully exploit the potential of the parallel processing of Big Data infrastructures. Multiple data sources are first integrated into a unified knowledge graph (KG). Different modalities of data are addressed by specifying ad hoc views on the KG and producing a rewriting of the graph containing merely the data to be processed. Graph traversal and rule extraction are this way boosted. Using graph embeddings methods, the different ad hoc views can be transformed into low-dimensional representation following the same data format. This way a single machine learning procedure can address the different modalities, simplifying the architecture of our system. The experiments we executed demonstrate that our approach reduces the cost of execution and improves the accuracy of analytics.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Big Data; big graph; data fusion; multimodal analysis
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Titolo del progetto
	
	Titolo Progetto
	
									Piano di Sostegno alla Ricerca 2015-2017 - Linea 2 "Dotazione annuale per attività istituzionali" (anno 2020)
								
	Nome finanziatore
	
										UNIVERSITA' DEGLI STUDI DI MILANO
									
	Data di pubblicazione
	
				2022
			
	Data ahead of print o data di stampa
	
				14-ott-2022
			
	Rivista in ANCE
	
				BIG DATA
			
	DOI
	
				https://dx.doi.org/10.1089/big.2021.0326
			
	URL
	
				https://doi.org/10.1089/big.2021.0326
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
big.2021.0326.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 685.66 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	685.66 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/943606

Citazioni

ND

10

4

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca