Leveraging RAG for Privacy Violation Detection and Explainability

Locci, S.; Audrito, D.; Livraga, G.; Viviani, M.; Di Caro, L.

doi:10.1109/IJCNN64981.2025.11228403

In today’s digital landscape, users frequently share vast amounts of information, including confidential data, often without full awareness of the associated privacy risks. This scenario highlights the need for automated methods to identify sensitive information and alert users to such risks. Existing algorithmic solutions for detecting sensitive content typically require either human intervention (rule-based approaches) or labeled data (supervised learning), both of which can be costly and limiting. In this paper, we propose a framework based on Retrieval-Augmented Generation (RAG) to classify privacy-sensitive content while providing contextual explanations. We employed the state-of-the-art generative Large Language Model (LLM) GPT-4o, with Information Retrieval models BM25 and FAISS, enhancing both detection accuracy and explainability. Our method utilizes a curated Knowledge Base of scientific literature on privacy and confidentiality to retrieve contextually relevant information, which is then used to guide the classification process and generate explanations. Experimental evaluations on a real-world dataset (Enron Email Dataset) demonstrate that RAG-based approaches significantly outperform the zero-shot baseline, with BM25 showing the highest performance. This tool is designed to serve end-users, by mitigating risks before data sharing, by enabling proactive monitoring of privacy violations.

Leveraging RAG for Privacy Violation Detection and Explainability / S. Locci, D. Audrito, G. Livraga, M. Viviani, L. Di Caro - In: IJCNN2025[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2025 Nov. - ISBN 979-8-3315-1042-8. (( International Joint Conference on Neural Networks : June 30 - July 5 Roma 2025 [10.1109/IJCNN64981.2025.11228403].

Leveraging RAG for Privacy Violation Detection and Explainability

S. Locci;D. Audrito;G. Livraga;M. Viviani;L. Di Caro

2025

Abstract

In today’s digital landscape, users frequently share vast amounts of information, including confidential data, often without full awareness of the associated privacy risks. This scenario highlights the need for automated methods to identify sensitive information and alert users to such risks. Existing algorithmic solutions for detecting sensitive content typically require either human intervention (rule-based approaches) or labeled data (supervised learning), both of which can be costly and limiting. In this paper, we propose a framework based on Retrieval-Augmented Generation (RAG) to classify privacy-sensitive content while providing contextual explanations. We employed the state-of-the-art generative Large Language Model (LLM) GPT-4o, with Information Retrieval models BM25 and FAISS, enhancing both detection accuracy and explainability. Our method utilizes a curated Knowledge Base of scientific literature on privacy and confidentiality to retrieve contextually relevant information, which is then used to guide the classification process and generate explanations. Experimental evaluations on a real-world dataset (Enron Email Dataset) demonstrate that RAG-based approaches significantly outperform the zero-shot baseline, with BM25 showing the highest performance. This tool is designed to serve end-users, by mitigating risks before data sharing, by enabling proactive monitoring of privacy violations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Privacy; Retrieval-Augmented Generation (RAG); Large Language Models (LLMs); Information Retrieval (IR); Knowledge Bases (KBs)
			
	Settori scientifico-disciplinari del contributo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
			
	Titolo del progetto
	
	Titolo Progetto
	
									Green responsibLe privACy preservIng dAta operaTIONs
								
	Acronimo
	
									GLACIATION
								
	Nome finanziatore
	
										EUROPEAN COMMISSION
									
	N. Contratto
	
									101070141
								
	Titolo Progetto
	
									KURAMi: Knowledge-based, explainable User empowerment in Releasing private data and Assessing Misinformation in online environments
								
	Acronimo
	
									KURAMI
								
	Nome finanziatore
	
										MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
									
	N. Contratto
	
									20225WTRFN_003
								
	Titolo Progetto
	
									SEcurity and RIghts in the CyberSpace (SERICS)
								
	Acronimo
	
									SERICS
								
	Nome finanziatore
	
										MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
									
	N. Contratto
	
									codice identificativo PE00000014
								
	Data di pubblicazione
	
				nov-2025
			
	Enti collegati al convegno
	
				Institute of Electrical and Electronics Engineers (IEEE)
International Neural Network Society
			
	DOI
	
				https://dx.doi.org/10.1109/IJCNN64981.2025.11228403
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
lalvd-ijcnn2025.pdf accesso riservato Tipologia: Publisher's version/PDF Licenza: Nessuna licenza Dimensione 319.36 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	319.36 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1224157

Citazioni

ND

0

ND

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca