IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

In this paper, we improve the robustness of Machine Learning (ML) classifiers against training-time attacks by linking the risk of training data being tampered with to the redundancy in the ML model's design needed to prevent it. Our defense mechanism is directly applicable to classifiers' training data, without any knowledge of the specific ML model to be hardened. First, we compute the training data proximity to class separation surfaces, identified via a reference linear model. Each data point is associated with a risk index, which is used to partition the training set by an unsupervised technique. Then, we train a learner for each partition and combine the learners' output in an ensemble. Our method treats the protected ML classifier as a black box and is inherently robust to transfer attacks. Experiments show that, for data poisoning rates between 6 and 25 percent of the training set, our method is more robust compared to benchmarks and to a monolithic version of the model trained on the whole training set. Our results make a convincing case for adopting training set partitioning and ensemble generation as a stage of ML models' development and deployment lifecycle.

Robust ML model ensembles via risk-driven anti-clustering of training data / L. Mauri, B. Apolloni, E. Damiani. - In: INFORMATION SCIENCES. - ISSN 0020-0255. - 633:(2023 Jul), pp. 122-140. [10.1016/j.ins.2023.03.085]

Robust ML model ensembles via risk-driven anti-clustering of training data

L. Mauri^Primo;B. Apolloni;E. Damiani

2023

Abstract

In this paper, we improve the robustness of Machine Learning (ML) classifiers against training-time attacks by linking the risk of training data being tampered with to the redundancy in the ML model's design needed to prevent it. Our defense mechanism is directly applicable to classifiers' training data, without any knowledge of the specific ML model to be hardened. First, we compute the training data proximity to class separation surfaces, identified via a reference linear model. Each data point is associated with a risk index, which is used to partition the training set by an unsupervised technique. Then, we train a learner for each partition and combine the learners' output in an ensemble. Our method treats the protected ML classifier as a black box and is inherently robust to transfer attacks. Experiments show that, for data poisoning rates between 6 and 25 percent of the training set, our method is more robust compared to benchmarks and to a monolithic version of the model trained on the whole training set. Our results make a convincing case for adopting training set partitioning and ensemble generation as a stage of ML models' development and deployment lifecycle.

Scheda breve

Scheda completa

Scheda completa (DC)

	Presenza di coautori internazionali
	
				No
			
	Lingua dell'articolo
	
				English
			
	Parole chiave
	
				Adversarial machine learning; Machine learning security; Robust ensemble models; Poisoning attack; Training set partitioning; Risk modeling
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Tipo
	
				Articolo
			
	Revisione (peer review)
	
				Esperti anonimi
			
	Classificazione della pubblicazione
	
				Pubblicazione scientifica
			
	Titolo del progetto
	
	Titolo Progetto
	
									Sovereign Edge-Hub: un’architettura cloud-edge per la sovranità digitale nelle scienze della vita (SOV-EDGE-HUB)Linea Strategica 4 - Sicurezza informatica/Cloud
								
	Acronimo
	
									SOV-EDGE-HUB
								
	Nome finanziatore
	
										UNIVERSITA' DEGLI STUDI DI MILANO
									
	Data di pubblicazione
	
				lug-2023
			
	Data ahead of print o data di stampa
	
				13-mar-2023
			
	Rivista in ANCE
	
				INFORMATION SCIENCES
			
	Editore
	
				Elsevier
			
	Volume o annata
	
				633
			
	Pagina iniziale
	
				122
			
	Pagina finale
	
				140
			
	Numero di pagine
	
				19
			
	Stato di pubblicazione
	
				Pubblicato
			
	Rilevanza del periodico
	
				Periodico con rilevanza internazionale
			
	DOI
	
				https://dx.doi.org/10.1016/j.ins.2023.03.085
			
	Banca dati sorgente
	
				orcid
crossref
			
	Identificativo ISI
	
				WOS:000952017500001
			
	Identificativo SCOPUS
	
				2-s2.0-85150025882
			
	Adesione alla policy Open Access di Ateneo
	
				Aderisco
			
	Tipologia
	
				info:eu-repo/semantics/article
			
	Citazione
	
				Robust ML model ensembles via risk-driven anti-clustering of training data / L. Mauri, B. Apolloni, E. Damiani. - In: INFORMATION SCIENCES. - ISSN 0020-0255. - 633:(2023 Jul), pp. 122-140. [10.1016/j.ins.2023.03.085]
			
	Fulltext
	
				open
			
	Tipologia
	
				Prodotti della ricerca::01 - Articolo su periodico
			
	Numero autori
	
				3
			
	Tipologia sito docente
	
				262
			
	Tipologia
	
				Article (author)
			
	Presenza impact factor
	
				Periodico con Impact Factor
			
	Tutti gli autori
	
						L. Mauri, B. Apolloni, E. Damiani
					
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
Robust ML model ensembles via risk-driven anti-clustering of training data.pdf accesso aperto Tipologia: Publisher's version/PDF Dimensione 2.14 MB Formato Adobe PDF Visualizza/Apri	2.14 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/958821

Citazioni

ND

13

10

ND

social impact