1. Lightweight Oriented Object Detection Using Multiscale Context and Enhanced Channel Attention in Remote Sensing Images
- Author
-
Yuanfeng Wu, Shengliang Pu, Boya Zhao, Zijin Li, Qing Wang, and Qiong Ran
- Subjects
Atmospheric Science ,Channel (digital image) ,Computer science ,Remote sensing application ,Geophysics. Cosmic physics ,Feature extraction ,0211 other engineering and technologies ,Context (language use) ,02 engineering and technology ,010501 environmental sciences ,01 natural sciences ,Convolutional neural network ,remote sensing ,Computers in Earth Sciences ,TC1501-1800 ,021101 geological & geomatics engineering ,0105 earth and related environmental sciences ,Remote sensing ,QC801-809 ,object detection ,Object detection ,Visualization ,Ocean engineering ,Channel attention ,Feature (computer vision) ,lightweight convolutional neural network (CNN) ,multiscale context - Abstract
Object detection is a focal point in remote sensing applications. Remote sensing images typically contain a large number of small objects and a wide range of orientations across objects. This results in great challenges to small object detection approaches based on remote sensing images. Methods directly employ channel relations with equal weights to construct information features leads to inadequate feature representation in complex image small object detection tasks. Multiscale detection methods improve the speed and accuracy of detection, while small objects themselves contain limited information, and the features are easily lost following down-sampling. During the detection, the feature images are independent across scales, resulting in a discontinuity at the detection scale. In this article, we propose the multiscale context and enhanced channel attention (MSCCA) model. MSCCA employs PeleeNet as the backbone network. In particular, the feature image channel attention is enhanced and the multiscale context information is fused with multiscale detection methods to improve the characterization ability of the convolutional neural network. The proposed MSCCA method is evaluated on two real datasets. Results show that for 512 × 512 input images, MSCCA was able to achieve 80.4% and 94.4% mAP on the DOTA and NWPU VHR-10, respectively. Meanwhile, the model size of MSCCA is 21% smaller than that of its predecessor. MSCCA can be considered as a practical lightweight oriented object detection model in remote sensing images.
- Published
- 2021