Graph-patrolling problems in the adversarial domain typically embed models and assumptions about how hostile events, from which an environment must be protected, are generated at a specific time and location. Relying upon such attacker models prevents algorithms from synthesizing strategies that can generalize in different settings, providing good performance under different and uncertain scenarios. In this paper, we propose a first method to deal with adversarial patrolling using a data driven approach. We cast the problem in an RL setting where the reward function is based on the ability to neutralize attacks that can follow an unknown strategy and that, hence, can be viewed as a black box component. We apply a policy gradient framework for optimizing action probabilities under such a reward model showing how effective patrolling strategies can be obtained from repeated attack-defense interactions between a patrolling agent and an attacker. Our results show that the data driven patroller can effectively provide protection against multiple, diverse attacker behaviors.
Learning Generalizable Patrolling Strategies through Domain Randomization of Attacker Behaviors / C.D. Alvarenga, N. Basilico, S. Carpin (IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION). - In: ICRA[s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2024. - ISBN 979-8-3503-8458-1. - pp. 4406-4412 (( convegno International Conference on Robotics and Automation : May, 13 - 17 tenutosi a Yokohama nel 2024 [10.1109/ICRA57147.2024.10610052].
Learning Generalizable Patrolling Strategies through Domain Randomization of Attacker Behaviors
N. BasilicoPenultimo
;
2024
Abstract
Graph-patrolling problems in the adversarial domain typically embed models and assumptions about how hostile events, from which an environment must be protected, are generated at a specific time and location. Relying upon such attacker models prevents algorithms from synthesizing strategies that can generalize in different settings, providing good performance under different and uncertain scenarios. In this paper, we propose a first method to deal with adversarial patrolling using a data driven approach. We cast the problem in an RL setting where the reward function is based on the ability to neutralize attacks that can follow an unknown strategy and that, hence, can be viewed as a black box component. We apply a policy gradient framework for optimizing action probabilities under such a reward model showing how effective patrolling strategies can be obtained from repeated attack-defense interactions between a patrolling agent and an attacker. Our results show that the data driven patroller can effectively provide protection against multiple, diverse attacker behaviors.| File | Dimensione | Formato | |
|---|---|---|---|
|
icra24_b.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
546.22 kB
Formato
Adobe PDF
|
546.22 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




