Back to Search Start Over

Learnable Cross-Scale Sparse Attention Guided Feature Fusion for UAV Object Detection

Authors :
Xin Zuo
Chenhui Qi
Yifei Chen
Jifeng Shen
Heng Fan
Wankou Yang
Source :
IEEE Access, Vol 12, Pp 114212-114226 (2024)
Publication Year :
2024
Publisher :
IEEE, 2024.

Abstract

Object detection in Unmanned Aerial Vehicle (UAVs) faces a significant challenge in computer vision. Traditional methods are difficult to model object appearance feature with large scale variations and viewpoint differences, when drones fly at different altitudes and capture images from diverse shooting angles. To address this issue, we propose a Learnable Cross-scale Sparse Attention (LCSA) guided feature fusion method to improve the performance of UAV object detection. Specifically, the LCSA feature fusion module enables each point in a feature map to aggregate discriminative information from a set of points with learnable offsets in neighbor feature maps. It enhances local discriminative features of the object by facilitating semantic information interaction across multiple feature maps. The LCSA can function as a novel neck method that complements the existing neck methods and is also transplantable to different object detection frameworks. Moreover, we also employ a scale-aware loss function to integrate the normalized Wasserstein distance with CIoU in order to improve the incompatibility of IoU for objects with large scale variance. Experimental results on the SeaDroneSeev2 and VisDrone2019-DET datasets show that the proposed method achieves superior performance. At a resolution of 640*640, our method achieves 81.9% AP50 and 47.4% AP on SeaDroneSeev2, surpassing baseline 4.9% and 4.8%, achieves state-of-the-art performance. Furthermore, our method outperforms baseline by 5% AP on VisDrone2019-DET. Code will be available at https://github.com/qch777/LSACF.

Details

Language :
English
ISSN :
21693536
Volume :
12
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.484e9c245bc844718a583cfac32b8246
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2024.3444900