1. FCDS-DETR: detection transformer based on feature correction and double sampling.
- Author
-
Wang, Min, Jiao, Zhiqiang, Huang, Zhanhua, and Yu, Shihang
- Subjects
- *
BIAS correction (Topology) , *PROBLEM solving , *POINT set theory - Abstract
The recently proposed semantic-aligned matching detection transformer (SAM–DETR model) accelerates the convergence of the detection transformer (DETR) by mapping object queries into an identical embedding space as the encoder's output feature map. However, SAM–DETR model has the problem of low detection accuracy compared to other DETR variants. We observe that the lower detection accuracy of SAM–DETR model is caused by the insufficient number of sample points and the inaccurate localization of the sample points during re-sampling, which blurs the generated attention map. This paper proposes an object detector based on a feature correction and double sampling DETR (FCDS-DETR) to solve this problem. FCDS-DETR takes SAM–DETR model as a baseline and builds on it by adding a feature correction module and a double sampling mechanism to achieve further improvement in detection accuracy with a limited number of additional parameters without sacrificing convergence speed. Firstly, FCDS-DETR improves the sampling point localization accuracy by adding a feature correction module to model the inter-channel dependence of the feature maps to be sampled. Secondly, the number of sampled points is increased by the double sampling mechanism, and attention fusion is used to fuse the attention weight maps corresponding to the two sets of sampled points to improve the recognizability of the attention weight maps. The experimental results show that the average precision is improved by +0.7 on the COCO dataset compared with the SAM–DETR model, and the number of parameters is increased by only 10.34 % , which improves the detection performance of the model very well. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF