Big Data does not only refer to a huge amount of diverse and heterogeneous data. It also points to the management of procedures, technologies, and competencies associated with the analysis of such data, with the aim of supporting high-quality decision making. There are, however, several obstacles to the effective management of a Big Data computation, such as data velocity, variety, and veracity, and technological complexity, which represent the main barriers towards the full adoption of the Big Data paradigm. The goal of this work is to define a new software Development Life Cycle for the design and implementation of a Big Data computation. Our proposal integrates two model-driven methods: a first method based on pre-configured services that reduces the cost of deployment and a second method based on custom component development that provides an incremental process of refinement and customization. The proposal is experimentally evaluated by clustering a data set of the distribution of the population in the United States based on contextual criteria.
A Fast and Incremental Development Life Cycle for Data Analytics as a Service / C.A. Ardagna, V. Bellandi, P. Ceravolo, E. Damiani, Di Martino Beniamino, S. D'Angelo, A. Esposito - In: 2018 IEEE International Congress on Big Data (BigData Congress)[s.l] : IEEE, 2018. - ISBN 9781538672327. - pp. 174-181 (( convegno International Congress on Big Data (IEEE BigData) Part of the IEEE World Congress on Services tenutosi a San Francisco nel 2018 [10.1109/BigDataCongress.2018.00030].
A Fast and Incremental Development Life Cycle for Data Analytics as a Service
C.A. Ardagna
;V. Bellandi
;P. Ceravolo
;E. Damiani;
2018
Abstract
Big Data does not only refer to a huge amount of diverse and heterogeneous data. It also points to the management of procedures, technologies, and competencies associated with the analysis of such data, with the aim of supporting high-quality decision making. There are, however, several obstacles to the effective management of a Big Data computation, such as data velocity, variety, and veracity, and technological complexity, which represent the main barriers towards the full adoption of the Big Data paradigm. The goal of this work is to define a new software Development Life Cycle for the design and implementation of a Big Data computation. Our proposal integrates two model-driven methods: a first method based on pre-configured services that reduces the cost of deployment and a second method based on custom component development that provides an incremental process of refinement and customization. The proposal is experimentally evaluated by clustering a data set of the distribution of the population in the United States based on contextual criteria.File | Dimensione | Formato | |
---|---|---|---|
bigdata.pdf
accesso riservato
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
330.98 kB
Formato
Adobe PDF
|
330.98 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
08457746.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
273.76 kB
Formato
Adobe PDF
|
273.76 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.