The recent rapid advancements in both sensing and machine learning technologies have given rise to the universal collection and utilization of people’s biometrics, such as fingerprints, voices, retina/facial scans, or gait/motion/gestures data, enabling a wide range of applications including authentication, health monitoring, or much more sophisticated analytics. While providing better user experiences and deeper business insights, the use of biometrics has raised serious privacy concerns due to their intrinsic sensitive nature and the accompanying high risk of leaking sensitive information such as identity or medical conditions. In this paper, we propose a novel modality-agnostic data transformation framework that is capable of anonymizing biometric data by suppressing its sensitive attributes while retaining features relevant to downstream machine learning-based analyses that are of research and business values. We carried out a thorough experimental evaluation using publicly available facial, voice, motion, and EEG datasets. Results show that our proposed framework can achieve a high suppression level for sensitive information, while at the same time retain underlying data utility such that subsequent analyses on the anonymized biometric data could still be carried out to yield satisfactory accuracy.
Model-Agnostic Utility-Preserving Biometric Information Anonymization / C. Chen, B. Moriarty, S. Hu, S. Moran, M. Pistoia, V. Piuri, P. Samarati. - In: INTERNATIONAL JOURNAL OF INFORMATION SECURITY. - ISSN 1615-5270. - 23:(2024), pp. 2809-2826. [10.1007/s10207-024-00862-8]
Model-Agnostic Utility-Preserving Biometric Information Anonymization
V. Piuri;P. Samarati
2024
Abstract
The recent rapid advancements in both sensing and machine learning technologies have given rise to the universal collection and utilization of people’s biometrics, such as fingerprints, voices, retina/facial scans, or gait/motion/gestures data, enabling a wide range of applications including authentication, health monitoring, or much more sophisticated analytics. While providing better user experiences and deeper business insights, the use of biometrics has raised serious privacy concerns due to their intrinsic sensitive nature and the accompanying high risk of leaking sensitive information such as identity or medical conditions. In this paper, we propose a novel modality-agnostic data transformation framework that is capable of anonymizing biometric data by suppressing its sensitive attributes while retaining features relevant to downstream machine learning-based analyses that are of research and business values. We carried out a thorough experimental evaluation using publicly available facial, voice, motion, and EEG datasets. Results show that our proposed framework can achieve a high suppression level for sensitive information, while at the same time retain underlying data utility such that subsequent analyses on the anonymized biometric data could still be carried out to yield satisfactory accuracy.File | Dimensione | Formato | |
---|---|---|---|
2405.15062v1.pdf
accesso aperto
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
766.43 kB
Formato
Adobe PDF
|
766.43 kB | Adobe PDF | Visualizza/Apri |
s10207-024-00862-8.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
1.38 MB
Formato
Adobe PDF
|
1.38 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.