Explaining an answer to a Datalog query is an essential task towards Explainable AI, especially nowadays where Datalog plays a critical role in the development of ontology-based applications. A well-established approach for explaining a query answer is the so-called why-provenance, which essentially collects all the subsets of the input database that can be used to obtain that answer via some derivation process, typically represented as a proof tree. It is well known, however, that computing the why-provenance for Datalog queries is computationally expensive, and thus, very few attempts can be found in the literature. The goal of this work is to demonstrate how off-the-shelf SAT solvers can be exploited towards an efficient computation of the why-provenance for Datalog queries. Interestingly, our SAT-based approach allows us to build the why-provenance in an incremental fashion, that is, one explanation at a time, which is much more useful in a practical context than the one-shot computation of the whole set of explanations as done by existing approaches.

Computing the Why-Provenance for Datalog Queries via SAT Solvers / M. Calautti, E. Livshits, A. Pieris, M. Schneider (CEUR WORKSHOP PROCEEDINGS). - In: SEBD 2024 / [a cura di] M. Atzori, P. Ciacci, M. Ceci, F. Mandreoli, D. Malerba, M. Sanguinetti, A. Pellicani, F. Motta. - [s.l] : CEUR, 2024. - pp. 51-60 (( Intervento presentato al 32. convegno Symposium on Advanced Database Systems : June, 23rd to 26th tenutosi a Villasimius nel 2024.

Computing the Why-Provenance for Datalog Queries via SAT Solvers

M. Calautti
Primo
;
2024

Abstract

Explaining an answer to a Datalog query is an essential task towards Explainable AI, especially nowadays where Datalog plays a critical role in the development of ontology-based applications. A well-established approach for explaining a query answer is the so-called why-provenance, which essentially collects all the subsets of the input database that can be used to obtain that answer via some derivation process, typically represented as a proof tree. It is well known, however, that computing the why-provenance for Datalog queries is computationally expensive, and thus, very few attempts can be found in the literature. The goal of this work is to demonstrate how off-the-shelf SAT solvers can be exploited towards an efficient computation of the why-provenance for Datalog queries. Interestingly, our SAT-based approach allows us to build the why-provenance in an incremental fashion, that is, one explanation at a time, which is much more useful in a practical context than the one-shot computation of the whole set of explanations as done by existing approaches.
computational complexity; Datalog queries; explainability; why-provenance;
Settore INFO-01/A - Informatica
   Dynamic Disinformation Networks: Where is the Truth? (DISTORT)
   DISTORT
   MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
   P2022KHTX7_003
2024
https://ceur-ws.org/Vol-3741/
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
C31 (SEBD 2024).pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 1.29 MB
Formato Adobe PDF
1.29 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1106292
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact