Learning Generalizable Patrolling Strategies through Domain Randomization of Attacker Behaviors

Alvarenga, C.D.; Basilico, N.; Carpin, S.

doi:10.1109/ICRA57147.2024.10610052

Graph-patrolling problems in the adversarial domain typically embed models and assumptions about how hostile events, from which an environment must be protected, are generated at a specific time and location. Relying upon such attacker models prevents algorithms from synthesizing strategies that can generalize in different settings, providing good performance under different and uncertain scenarios. In this paper, we propose a first method to deal with adversarial patrolling using a data driven approach. We cast the problem in an RL setting where the reward function is based on the ability to neutralize attacks that can follow an unknown strategy and that, hence, can be viewed as a black box component. We apply a policy gradient framework for optimizing action probabilities under such a reward model showing how effective patrolling strategies can be obtained from repeated attack-defense interactions between a patrolling agent and an attacker. Our results show that the data driven patroller can effectively provide protection against multiple, diverse attacker behaviors.

Learning Generalizable Patrolling Strategies through Domain Randomization of Attacker Behaviors / C.D. Alvarenga, N. Basilico, S. Carpin (IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION). - In: ICRA[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2024. - ISBN 979-8-3503-8458-1. - pp. 4406-4412 (( convegno International Conference on Robotics and Automation : May, 13 - 17 tenutosi a Yokohama nel 2024 [10.1109/ICRA57147.2024.10610052].

Learning Generalizable Patrolling Strategies through Domain Randomization of Attacker Behaviors

Alvarenga C. D.;N. Basilico^Penultimo;Carpin S.

2024

Abstract

Graph-patrolling problems in the adversarial domain typically embed models and assumptions about how hostile events, from which an environment must be protected, are generated at a specific time and location. Relying upon such attacker models prevents algorithms from synthesizing strategies that can generalize in different settings, providing good performance under different and uncertain scenarios. In this paper, we propose a first method to deal with adversarial patrolling using a data driven approach. We cast the problem in an RL setting where the reward function is based on the ability to neutralize attacks that can follow an unknown strategy and that, hence, can be viewed as a black box component. We apply a policy gradient framework for optimizing action probabilities under such a reward model showing how effective patrolling strategies can be obtained from repeated attack-defense interactions between a patrolling agent and an attacker. Our results show that the data driven patroller can effectively provide protection against multiple, diverse attacker behaviors.

Scheda breve

Scheda completa

Scheda completa (DC)

	Settori scientifico-disciplinari del contributo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	Data di pubblicazione
	
				2024
			
	Enti collegati al convegno
	
				Institute of Electrical and Electronics Engineers (IEEE)
			
	DOI
	
				https://dx.doi.org/10.1109/ICRA57147.2024.10610052
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
icra24_b.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 546.22 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	546.22 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1156316

Citazioni

ND

1

0

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca