We present a framework that implements a service selection process tailored for the definition of service-based data pipelines. The framework addresses the critical challenge of balancing data quality and data protection in distributed, service-based data pipelines, an issue that existing solutions overlook by treating these dimensions independently. By modeling the pipeline as a Directed Acyclic Graph (DAG), the framework extends pipelines with functional and data protection requirements. An extensive experimental evaluation measures the performance of our framework by analyzing variations in data quality across diverse datasets and configurations.
A Framework for Data Quality and Protection Management in Service-Based Data Pipelines / A. Polimeno, M. Luzzara, M. Anisetti, C.A. Ardagna, C. Ghedira-Guegan (PROCEEDINGS IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES). - In: ICWS[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2025 Jul. - ISBN 979-8-3315-5563-4. - pp. 258-268 (( convegno International Conference on Web Services : 07-12 July tenutosi a Helsinki (Finland) nel 2025 [10.1109/icws67624.2025.00040].
A Framework for Data Quality and Protection Management in Service-Based Data Pipelines
A. PolimenoPrimo
;M. LuzzaraSecondo
;M. Anisetti;C.A. ArdagnaPenultimo
;
2025
Abstract
We present a framework that implements a service selection process tailored for the definition of service-based data pipelines. The framework addresses the critical challenge of balancing data quality and data protection in distributed, service-based data pipelines, an issue that existing solutions overlook by treating these dimensions independently. By modeling the pipeline as a Directed Acyclic Graph (DAG), the framework extends pipelines with functional and data protection requirements. An extensive experimental evaluation measures the performance of our framework by analyzing variations in data quality across diverse datasets and configurations.| File | Dimensione | Formato | |
|---|---|---|---|
|
Polimeno - ICWS 2025 Services.pdf
accesso aperto
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Licenza:
Creative commons
Dimensione
1.38 MB
Formato
Adobe PDF
|
1.38 MB | Adobe PDF | Visualizza/Apri |
|
A_Framework_for_Data_Quality_and_Protection_Management_in_Service-Based_Data_Pipelines(1).pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Licenza:
Nessuna licenza
Dimensione
1.21 MB
Formato
Adobe PDF
|
1.21 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




