In the Internet of Things (IoT) era, we need to face increased masses of cross-domain data stored in different formats (either relational, XML, JSON, textual) and data streams (produced by sensors), that can be highly or loosely structured and need to be integrated for analysis. Recently, many NoSQL systems (e.g., MongoDB, Cassandra, HBASE) have been born for coping the scalability issues of current analysis approaches. They all are based on different data models and present different performances based on structure, size and location of data distributed on clusters. While the research community is really active in the development of techniques for big data analysis, at current stage a comprehensive solution that supports the user in loading data from heterogeneous sources and in integrating them into the most suitable NoSQL system is still lacking. In this paper we propose the design and expected functionalities of Big Loader, a user-friendly loading system for NoSQL systems that allows the specification of the conceptual schema of the data to be loaded, the specification of the sources from which the data should be gathered in order to feed the conceptual schema and finally we outline intelligent strategies for the selection of the NoSQL system where the conceptual scheme can be deployed.

Towards a user-friendly loading system for the analysis of big data in the Internet of things / M. Mesiti, S. Valtolina - In: 2014 IEEE 38. Annual International computers, software and applications conference workshops : COMPSACW 2014 : 27–29 July 2014 , Västerås, Sweden / [a cura di] C.K. Chang, Y. Gao, A. Hurson, M. Matskin, B. McMillin, Y. Okabe, C. Seceleanu, K. Yoshida. - Los Alamitos, California : IEEE computer society, 2014 Jul 21. - ISBN 9781479935789. - pp. 312-317 (( Intervento presentato al 38. convegno Annual International Computers, Software and Applications Conference Workshops (COMPSACW) tenutosi a Vasteras nel 2014 [10.1109/COMPSACW.2014.55].

Towards a user-friendly loading system for the analysis of big data in the Internet of things

M. Mesiti
Primo
;
S. Valtolina
Ultimo
2014

Abstract

In the Internet of Things (IoT) era, we need to face increased masses of cross-domain data stored in different formats (either relational, XML, JSON, textual) and data streams (produced by sensors), that can be highly or loosely structured and need to be integrated for analysis. Recently, many NoSQL systems (e.g., MongoDB, Cassandra, HBASE) have been born for coping the scalability issues of current analysis approaches. They all are based on different data models and present different performances based on structure, size and location of data distributed on clusters. While the research community is really active in the development of techniques for big data analysis, at current stage a comprehensive solution that supports the user in loading data from heterogeneous sources and in integrating them into the most suitable NoSQL system is still lacking. In this paper we propose the design and expected functionalities of Big Loader, a user-friendly loading system for NoSQL systems that allows the specification of the conceptual schema of the data to be loaded, the specification of the sources from which the data should be gathered in order to feed the conceptual schema and finally we outline intelligent strategies for the selection of the NoSQL system where the conceptual scheme can be deployed.
data loader; NoSQL; data preparation for analysis
Settore INF/01 - Informatica
21-lug-2014
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
06903148.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 279.46 kB
Formato Adobe PDF
279.46 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/254427
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 19
  • ???jsp.display-item.citation.isi??? 6
social impact