We present a framework that implements a service selection process tailored for the definition of service-based data pipelines. The framework addresses the critical challenge of balancing data quality and data protection in distributed, service-based data pipelines, an issue that existing solutions overlook by treating these dimensions independently. By modeling the pipeline as a Directed Acyclic Graph (DAG), the framework extends pipelines with functional and data protection requirements. An extensive experimental evaluation measures the performance of our framework by analyzing variations in data quality across diverse datasets and configurations.

A Framework for Data Quality and Protection Management in Service-Based Data Pipelines / A. Polimeno, M. Luzzara, M. Anisetti, C.A. Ardagna, C. Ghedira-Guegan (PROCEEDINGS IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES). - In: ICWS[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2025 Jul. - ISBN 979-8-3315-5563-4. - pp. 258-268 (( convegno International Conference on Web Services : 07-12 July tenutosi a Helsinki (Finland) nel 2025 [10.1109/icws67624.2025.00040].

A Framework for Data Quality and Protection Management in Service-Based Data Pipelines

A. Polimeno
Primo
;
M. Luzzara
Secondo
;
M. Anisetti;C.A. Ardagna
Penultimo
;
2025

Abstract

We present a framework that implements a service selection process tailored for the definition of service-based data pipelines. The framework addresses the critical challenge of balancing data quality and data protection in distributed, service-based data pipelines, an issue that existing solutions overlook by treating these dimensions independently. By modeling the pipeline as a Directed Acyclic Graph (DAG), the framework extends pipelines with functional and data protection requirements. An extensive experimental evaluation measures the performance of our framework by analyzing variations in data quality across diverse datasets and configurations.
Benchmark; Distributed Services; Data Pipelines; Data Protection; Data quality;
Settore INFO-01/A - Informatica
lug-2025
Institute of Electrical and Electronics Engineers (IEEE)
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
Polimeno - ICWS 2025 Services.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Licenza: Creative commons
Dimensione 1.38 MB
Formato Adobe PDF
1.38 MB Adobe PDF Visualizza/Apri
A_Framework_for_Data_Quality_and_Protection_Management_in_Service-Based_Data_Pipelines(1).pdf

accesso riservato

Tipologia: Publisher's version/PDF
Licenza: Nessuna licenza
Dimensione 1.21 MB
Formato Adobe PDF
1.21 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1187775
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact