Unmanned aerial vehicle (UAV) remote sensing images used for semantic segmentation possess distinct features compared to urban street scene images, including high resolution and a complex background. Spatial information plays a pivotal role in enhancing the performance of semantic segmentation for high-resolution images. The dual-branch architecture for semantic segmentation incorporates supplementary branches to capture spatial information. However, prior research on dual-branch semantic segmentation neglected the interaction between the contextual and spatial branches, leading to suboptimal model performance. In this discourse, the paper introduces a dual-branch semantic segmentation framework. This design advances the system's understanding of spatial information while facilitating inter-branch learning through two key modules. Initially, the spatial calibration feature extraction module employs frequency domain processing and learning tactics distinct from the contextual approach to generate image features under varied noise conditions. Calibration is achieved by generating features from diverse angles. Subsequently, the spatially-guided loss function directs the acquisition of spatial information for the spatial branch by condensing the deep image characteristics for the context branch. To assess the generalization capacity of the proposed method, experiments will be conducted on three different datasets. The proposed method's modules will be integrated into three representative dual-branch networks, allowing assessment of the generalization capacity of the key DBCG components. Empirical evidence demonstrates that this approach is highly effective, significantly surpassing the performance of the baseline network.
DBCG-Net: Dual Branch Calibration Guided Deep Network for UAV Images Semantic Segmentation / C. Mai, Y. Wu, Y. Zhai, H. Quan, J. Zhou, A. Genovese, V. Piuri, F. Scotti. - In: IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING. - ISSN 1939-1404. - 17:(2024 Mar 19), pp. 7932-7945. [10.1109/jstars.2024.3378695]
DBCG-Net: Dual Branch Calibration Guided Deep Network for UAV Images Semantic Segmentation
A. Genovese;V. PiuriPenultimo
;F. ScottiUltimo
2024
Abstract
Unmanned aerial vehicle (UAV) remote sensing images used for semantic segmentation possess distinct features compared to urban street scene images, including high resolution and a complex background. Spatial information plays a pivotal role in enhancing the performance of semantic segmentation for high-resolution images. The dual-branch architecture for semantic segmentation incorporates supplementary branches to capture spatial information. However, prior research on dual-branch semantic segmentation neglected the interaction between the contextual and spatial branches, leading to suboptimal model performance. In this discourse, the paper introduces a dual-branch semantic segmentation framework. This design advances the system's understanding of spatial information while facilitating inter-branch learning through two key modules. Initially, the spatial calibration feature extraction module employs frequency domain processing and learning tactics distinct from the contextual approach to generate image features under varied noise conditions. Calibration is achieved by generating features from diverse angles. Subsequently, the spatially-guided loss function directs the acquisition of spatial information for the spatial branch by condensing the deep image characteristics for the context branch. To assess the generalization capacity of the proposed method, experiments will be conducted on three different datasets. The proposed method's modules will be integrated into three representative dual-branch networks, allowing assessment of the generalization capacity of the key DBCG components. Empirical evidence demonstrates that this approach is highly effective, significantly surpassing the performance of the baseline network.File | Dimensione | Formato | |
---|---|---|---|
jstars24.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
35.9 MB
Formato
Adobe PDF
|
35.9 MB | Adobe PDF | Visualizza/Apri |
jstars24(2)_compressed.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
647.41 kB
Formato
Adobe PDF
|
647.41 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.