Group-split attention network for crowd counting

Zhai, WZ; Gao, ML; Anisetti, M; Li, QL; Jeon, S; Pan, JF

doi:10.1117/1.JEI.31.4.041214

Crowd counting is a considerable yet challenging task in intelligent video surveillance and urban security systems. The performance has been significantly boosted along with the springing up of the convolutional neural networks (CNNs). However, accurate and efficient crowd counting in congested scenes remains under-explored due to scale variation and cluttered background. To address these problems, we propose a biologically inspired crowd counting method named group-split attention network (GSANet). The GSANet consists of three principal modules, namely GS module, dual-aware attention module, and aggregation module. The GS module processes the subfeatures of each group in parallel, and groups the input feature map to reduce the computational cost. The dual-aware attention module synergies the spatial and channel dimensional information to alleviate the estimation error in background regions. The aggregation module adopts a learning-based cross-group strategy to aggregate and facilitate the fusion of feature maps along different channel dimensions. Extensive experimental results on five benchmark crowd datasets demonstrate that the GSANet achieves superior performances in terms of accuracy and efficiency.

Group-split attention network for crowd counting / W. Zhai, M. Gao, M. Anisetti, Q. Li, S. Jeon, J. Pan. - In: JOURNAL OF ELECTRONIC IMAGING. - ISSN 1017-9909. - 31:4(2022 Jul 01), pp. 1-10. [10.1117/1.JEI.31.4.041214]

Group-split attention network for crowd counting

Zhai, WZ;Gao, ML;M. Anisetti;Li, QL;Jeon, S;Pan, JF

2022

Abstract

Crowd counting is a considerable yet challenging task in intelligent video surveillance and urban security systems. The performance has been significantly boosted along with the springing up of the convolutional neural networks (CNNs). However, accurate and efficient crowd counting in congested scenes remains under-explored due to scale variation and cluttered background. To address these problems, we propose a biologically inspired crowd counting method named group-split attention network (GSANet). The GSANet consists of three principal modules, namely GS module, dual-aware attention module, and aggregation module. The GS module processes the subfeatures of each group in parallel, and groups the input feature map to reduce the computational cost. The dual-aware attention module synergies the spatial and channel dimensional information to alleviate the estimation error in background regions. The aggregation module adopts a learning-based cross-group strategy to aggregate and facilitate the fusion of feature maps along different channel dimensions. Extensive experimental results on five benchmark crowd datasets demonstrate that the GSANet achieves superior performances in terms of accuracy and efficiency.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				crowd counting; convolutional neural network; attention mechanism; computer vision
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				1-lug-2022
			
	Rivista in ANCE
	
				JOURNAL OF ELECTRONIC IMAGING
			
	DOI
	
				https://dx.doi.org/10.1117/1.JEI.31.4.041214
			
	Tipologia
	
				Article (author)

File in questo prodotto:

File	Dimensione	Formato
Group-split attention network for crowd counting.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 6.34 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	6.34 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/938556

Citazioni

ND

20

19

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca