Big Data does not only refer to a huge amount of diverse and heterogeneous data. It also points to the management of procedures, technologies, and competencies associated with the analysis of such data, with the aim of supporting high-quality decision making. There are, however, several obstacles to the effective management of a Big Data computation, such as data velocity, variety, and veracity, and technological complexity, which represent the main barriers towards the full adoption of the Big Data paradigm. The goal of this work is to define a new software Development Life Cycle for the design and implementation of a Big Data computation. Our proposal integrates two model-driven methods: a first method based on pre-configured services that reduces the cost of deployment and a second method based on custom component development that provides an incremental process of refinement and customization. The proposal is experimentally evaluated by clustering a data set of the distribution of the population in the United States based on contextual criteria.

A Fast and Incremental Development Life Cycle for Data Analytics as a Service / C.A. Ardagna, V. Bellandi, P. Ceravolo, E. Damiani, Di Martino Beniamino, S. D'Angelo, A. Esposito - In: 2018 IEEE International Congress on Big Data (BigData Congress)[s.l] : IEEE, 2018. - ISBN 9781538672327. - pp. 174-181 (( convegno International Congress on Big Data (IEEE BigData) Part of the IEEE World Congress on Services tenutosi a San Francisco nel 2018 [10.1109/BigDataCongress.2018.00030].

A Fast and Incremental Development Life Cycle for Data Analytics as a Service

C.A. Ardagna
;
V. Bellandi
;
P. Ceravolo
;
E. Damiani;
2018

Abstract

Big Data does not only refer to a huge amount of diverse and heterogeneous data. It also points to the management of procedures, technologies, and competencies associated with the analysis of such data, with the aim of supporting high-quality decision making. There are, however, several obstacles to the effective management of a Big Data computation, such as data velocity, variety, and veracity, and technological complexity, which represent the main barriers towards the full adoption of the Big Data paradigm. The goal of this work is to define a new software Development Life Cycle for the design and implementation of a Big Data computation. Our proposal integrates two model-driven methods: a first method based on pre-configured services that reduces the cost of deployment and a second method based on custom component development that provides an incremental process of refinement and customization. The proposal is experimentally evaluated by clustering a data set of the distribution of the population in the United States based on contextual criteria.
Big Data Analytics; Model-Driven Development; Software Development Life Cycle
Settore INF/01 - Informatica
   TrustwOrthy model-awaRE Analytics Data platfORm
   TOREADOR
   EUROPEAN COMMISSION
   H2020
   688797
2018
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
bigdata.pdf

accesso riservato

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 330.98 kB
Formato Adobe PDF
330.98 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
08457746.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 273.76 kB
Formato Adobe PDF
273.76 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/619765
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 2
social impact