Genetic data are the most sensitive information for a person, containing many specific features that uniquely determine an individual and also make it possible to trace relationships with other people or evaluate the pre- disposition to particular diseases. For this reason, any processing of genetic data should be carefully performed and any threat to their privacy properly considered. A very important computation in medical and public health domains involves the evaluation of the edit distance between human genomes, that can eventually lead to a better diagnosis of several diseases. To maintain the privacy of the genetic data, it is possible to apply secure computation protocols and then, in this context, the improvement of the computational performance of such techniques is a key factor for real-world application scenarios. In this paper we focus on the application of the garbling circuit technique for the computation of the edit dis- tance, showing its efficiency. We apply the technique considering four different algorithms and compare their performances to the best previous results found in literature. We show that the Ukkonen algorithm with gener- alized cut-off is the one that performed better among the considered algorithms, reporting some experimental results obtained considering datasets composed of both randomly generated and real genomic strings
Efficient Secure Computation of Edit Distance on Genomic Data / A. Migliore, S. Cimato, G. Trucco - In: Proceedings of the ICISSP / [a cura di] G. Lenzini, P. Mori, S. Furnell. - [s.l] : Scitepress, 2024. - ISBN 9789897586835. - pp. 878-883 (( Intervento presentato al 10. convegno International Conference on Information Systems Security and Privacy : February 26-28 tenutosi a Roma nel 2024 [10.5220/0012459400003648].
Efficient Secure Computation of Edit Distance on Genomic Data
S. Cimato
;G. Trucco
2024
Abstract
Genetic data are the most sensitive information for a person, containing many specific features that uniquely determine an individual and also make it possible to trace relationships with other people or evaluate the pre- disposition to particular diseases. For this reason, any processing of genetic data should be carefully performed and any threat to their privacy properly considered. A very important computation in medical and public health domains involves the evaluation of the edit distance between human genomes, that can eventually lead to a better diagnosis of several diseases. To maintain the privacy of the genetic data, it is possible to apply secure computation protocols and then, in this context, the improvement of the computational performance of such techniques is a key factor for real-world application scenarios. In this paper we focus on the application of the garbling circuit technique for the computation of the edit dis- tance, showing its efficiency. We apply the technique considering four different algorithms and compare their performances to the best previous results found in literature. We show that the Ukkonen algorithm with gener- alized cut-off is the one that performed better among the considered algorithms, reporting some experimental results obtained considering datasets composed of both randomly generated and real genomic stringsFile | Dimensione | Formato | |
---|---|---|---|
ICISSP_2024_147_CR.pdf
accesso riservato
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione
163.66 kB
Formato
Adobe PDF
|
163.66 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
124594.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
204.82 kB
Formato
Adobe PDF
|
204.82 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.