Unmanned aerial vehicle (UAV) remote sensing images used for semantic segmentation possess distinct features compared to urban street scene images, including high resolution and a complex background. Spatial information plays a pivotal role in enhancing the performance of semantic segmentation for high-resolution images. The dual-branch architecture for semantic segmentation incorporates supplementary branches to capture spatial information. However, prior research on dual-branch semantic segmentation neglected the interaction between the contextual and spatial branches, leading to suboptimal model performance. In this discourse, the paper introduces a dual-branch semantic segmentation framework. This design advances the system's understanding of spatial information while facilitating inter-branch learning through two key modules. Initially, the spatial calibration feature extraction module employs frequency domain processing and learning tactics distinct from the contextual approach to generate image features under varied noise conditions. Calibration is achieved by generating features from diverse angles. Subsequently, the spatially-guided loss function directs the acquisition of spatial information for the spatial branch by condensing the deep image characteristics for the context branch. To assess the generalization capacity of the proposed method, experiments will be conducted on three different datasets. The proposed method's modules will be integrated into three representative dual-branch networks, allowing assessment of the generalization capacity of the key DBCG components. Empirical evidence demonstrates that this approach is highly effective, significantly surpassing the performance of the baseline network.

DBCG-Net: Dual Branch Calibration Guided Deep Network for UAV Images Semantic Segmentation / C. Mai, Y. Wu, Y. Zhai, H. Quan, J. Zhou, A. Genovese, V. Piuri, F. Scotti. - In: IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING. - ISSN 1939-1404. - 17:(2024 Mar 19), pp. 7932-7945. [10.1109/jstars.2024.3378695]

DBCG-Net: Dual Branch Calibration Guided Deep Network for UAV Images Semantic Segmentation

A. Genovese;V. Piuri
Penultimo
;
F. Scotti
Ultimo
2024

Abstract

Unmanned aerial vehicle (UAV) remote sensing images used for semantic segmentation possess distinct features compared to urban street scene images, including high resolution and a complex background. Spatial information plays a pivotal role in enhancing the performance of semantic segmentation for high-resolution images. The dual-branch architecture for semantic segmentation incorporates supplementary branches to capture spatial information. However, prior research on dual-branch semantic segmentation neglected the interaction between the contextual and spatial branches, leading to suboptimal model performance. In this discourse, the paper introduces a dual-branch semantic segmentation framework. This design advances the system's understanding of spatial information while facilitating inter-branch learning through two key modules. Initially, the spatial calibration feature extraction module employs frequency domain processing and learning tactics distinct from the contextual approach to generate image features under varied noise conditions. Calibration is achieved by generating features from diverse angles. Subsequently, the spatially-guided loss function directs the acquisition of spatial information for the spatial branch by condensing the deep image characteristics for the context branch. To assess the generalization capacity of the proposed method, experiments will be conducted on three different datasets. The proposed method's modules will be integrated into three representative dual-branch networks, allowing assessment of the generalization capacity of the key DBCG components. Empirical evidence demonstrates that this approach is highly effective, significantly surpassing the performance of the baseline network.
Autonomous aerial vehicles; Calibration; CNN; deep learning; dual-branch calibration guided network; Feature extraction; Remote sensing; semantic segmentation; Semantic segmentation; Semantics; Spatial resolution; UAVs;
Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
19-mar-2024
Article (author)
File in questo prodotto:
File Dimensione Formato  
jstars24.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 35.9 MB
Formato Adobe PDF
35.9 MB Adobe PDF Visualizza/Apri
jstars24(2)_compressed.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 647.41 kB
Formato Adobe PDF
647.41 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1040772
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact