Back to Search
Start Over
Learnable Cross-Scale Sparse Attention Guided Feature Fusion for UAV Object Detection
- Source :
- IEEE Access, Vol 12, Pp 114212-114226 (2024)
- Publication Year :
- 2024
- Publisher :
- IEEE, 2024.
-
Abstract
- Object detection in Unmanned Aerial Vehicle (UAVs) faces a significant challenge in computer vision. Traditional methods are difficult to model object appearance feature with large scale variations and viewpoint differences, when drones fly at different altitudes and capture images from diverse shooting angles. To address this issue, we propose a Learnable Cross-scale Sparse Attention (LCSA) guided feature fusion method to improve the performance of UAV object detection. Specifically, the LCSA feature fusion module enables each point in a feature map to aggregate discriminative information from a set of points with learnable offsets in neighbor feature maps. It enhances local discriminative features of the object by facilitating semantic information interaction across multiple feature maps. The LCSA can function as a novel neck method that complements the existing neck methods and is also transplantable to different object detection frameworks. Moreover, we also employ a scale-aware loss function to integrate the normalized Wasserstein distance with CIoU in order to improve the incompatibility of IoU for objects with large scale variance. Experimental results on the SeaDroneSeev2 and VisDrone2019-DET datasets show that the proposed method achieves superior performance. At a resolution of 640*640, our method achieves 81.9% AP50 and 47.4% AP on SeaDroneSeev2, surpassing baseline 4.9% and 4.8%, achieves state-of-the-art performance. Furthermore, our method outperforms baseline by 5% AP on VisDrone2019-DET. Code will be available at https://github.com/qch777/LSACF.
Details
- Language :
- English
- ISSN :
- 21693536
- Volume :
- 12
- Database :
- Directory of Open Access Journals
- Journal :
- IEEE Access
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.484e9c245bc844718a583cfac32b8246
- Document Type :
- article
- Full Text :
- https://doi.org/10.1109/ACCESS.2024.3444900