ScanDDM: Generalised Zero-Shot Neuro-Dynamical Modelling of Goal-Directed Attention

D'Amelio, A.; Lucchi, M.; Boccignone, G.

doi:10.1007/978-3-031-91578-9_17

This paper introduces ScanDDM, a novel scanpath model enabling generalised zero-shot goal-directed attention prediction. Leveraging recent advancements in vision-and-language learning techniques, ScanDDM models goal-directed attention by integrating high-level abstract concepts provided through textual prompts. The approach relies on a multialternative Drift Diffusion Model (DDM), framing gaze dynamics as a decision-making process that encapsulates both fixation duration and saccade execution. This allows to implement a value-based evidence accumulation process akin to the neurobiological mechanisms surmised to underlie human perceptual decision making. ScanDDM’s efficacy is quantitatively evaluated against the state-of-the-art model on the COCO-Search18 dataset, demonstrating excellent capabilities in predicting task-driven scanpaths in a zero-shot setting. Moreover, qualitative results showcase ScanDDM’s ability to generalize to complex and abstract concepts, beyond simple visual search tasks. Source code available at: https://github.com/phuselab/scanDDM.

ScanDDM: Generalised Zero-Shot Neuro-Dynamical Modelling of Goal-Directed Attention / A. D'Amelio, M. Lucchi, G. Boccignone (LECTURE NOTES IN COMPUTER SCIENCE). - In: Computer Vision – ECCV 2024 Workshops / [a cura di] A. Del Bue, C. Canton, J. Pont-Tuset, T. Tommasi. - [s.l] : Springer Science and Business Media Deutschland GmbH, 2025. - ISBN 9783031915772. - pp. 234-244 (( Intervento presentato al 18. convegno ECCV tenutosi a Milano nel 2024 [10.1007/978-3-031-91578-9_17].

ScanDDM: Generalised Zero-Shot Neuro-Dynamical Modelling of Goal-Directed Attention

A. D'Amelio^Primo;Lucchi M.;G. Boccignone^Ultimo

2025

Abstract

This paper introduces ScanDDM, a novel scanpath model enabling generalised zero-shot goal-directed attention prediction. Leveraging recent advancements in vision-and-language learning techniques, ScanDDM models goal-directed attention by integrating high-level abstract concepts provided through textual prompts. The approach relies on a multialternative Drift Diffusion Model (DDM), framing gaze dynamics as a decision-making process that encapsulates both fixation duration and saccade execution. This allows to implement a value-based evidence accumulation process akin to the neurobiological mechanisms surmised to underlie human perceptual decision making. ScanDDM’s efficacy is quantitatively evaluated against the state-of-the-art model on the COCO-Search18 dataset, demonstrating excellent capabilities in predicting task-driven scanpaths in a zero-shot setting. Moreover, qualitative results showcase ScanDDM’s ability to generalize to complex and abstract concepts, beyond simple visual search tasks. Source code available at: https://github.com/phuselab/scanDDM.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Drift Diffusion Model; Human-inspired Attention Modelling; Scanpath Prediction; Visual Attention; Zero-Shot Visual Search
			
	Settori scientifico-disciplinari del contributo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
			
	Data di pubblicazione
	
				2025
			
	DOI
	
				https://dx.doi.org/10.1007/978-3-031-91578-9_17
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
978-3-031-91578-9_17 (3).pdf accesso riservato Tipologia: Publisher's version/PDF Licenza: Nessuna licenza Dimensione 10.24 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	10.24 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1173815

Citazioni

ND

0

0

0

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca