Multimodal Analytics in Big Data architectures implies compounded configurations of the data processing tasks. Each modality in data requires specific analytics that triggers specific data processing tasks. Scalability can be reached at the cost of an attentive calibration of the resources shared by the different tasks searching for a trade-off with the multiple requirements they impose. We propose a methodology to address multimodal analytics within the same data processing approach to get a simplified architecture that can fully exploit the potential of the parallel processing of Big Data infrastructures. Multiple data sources are first integrated into a unified knowledge graph (KG). Different modalities of data are addressed by specifying ad hoc views on the KG and producing a rewriting of the graph containing merely the data to be processed. Graph traversal and rule extraction are this way boosted. Using graph embeddings methods, the different ad hoc views can be transformed into low-dimensional representation following the same data format. This way a single machine learning procedure can address the different modalities, simplifying the architecture of our system. The experiments we executed demonstrate that our approach reduces the cost of execution and improves the accuracy of analytics.

Toward a General Framework for Multimodal Big Data Analysis / V. Bellandi, P. Ceravolo, S. Maghool, S. Siccardi. - In: BIG DATA. - ISSN 2167-6461. - 10:5(2022), pp. 408-424. [10.1089/big.2021.0326]

Toward a General Framework for Multimodal Big Data Analysis

V. Bellandi
Primo
;
P. Ceravolo
Secondo
;
S. Maghool
Penultimo
;
S. Siccardi
Ultimo
2022

Abstract

Multimodal Analytics in Big Data architectures implies compounded configurations of the data processing tasks. Each modality in data requires specific analytics that triggers specific data processing tasks. Scalability can be reached at the cost of an attentive calibration of the resources shared by the different tasks searching for a trade-off with the multiple requirements they impose. We propose a methodology to address multimodal analytics within the same data processing approach to get a simplified architecture that can fully exploit the potential of the parallel processing of Big Data infrastructures. Multiple data sources are first integrated into a unified knowledge graph (KG). Different modalities of data are addressed by specifying ad hoc views on the KG and producing a rewriting of the graph containing merely the data to be processed. Graph traversal and rule extraction are this way boosted. Using graph embeddings methods, the different ad hoc views can be transformed into low-dimensional representation following the same data format. This way a single machine learning procedure can address the different modalities, simplifying the architecture of our system. The experiments we executed demonstrate that our approach reduces the cost of execution and improves the accuracy of analytics.
Big Data; big graph; data fusion; multimodal analysis
Settore INF/01 - Informatica
   Piano di Sostegno alla Ricerca 2015-2017 - Linea 2 "Dotazione annuale per attività istituzionali" (anno 2020)
   UNIVERSITA' DEGLI STUDI DI MILANO
2022
14-ott-2022
https://doi.org/10.1089/big.2021.0326
Article (author)
File in questo prodotto:
File Dimensione Formato  
big.2021.0326.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 685.66 kB
Formato Adobe PDF
685.66 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/943606
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 3
social impact