1. Previewer for Multi-Scale Object Detector
- Author
-
Zhongming Jin, Guo-Jun Qi, Chen Shen, Zhihang Fu, Rongxin Jiang, Xian-Sheng Hua, and Yaowu Chen
- Subjects
Computer science ,business.industry ,Pattern recognition ,02 engineering and technology ,Pascal (programming language) ,010501 environmental sciences ,01 natural sciences ,Convolutional neural network ,Object detection ,Feature (computer vision) ,Face (geometry) ,0202 electrical engineering, electronic engineering, information engineering ,Benchmark (computing) ,False positive paradox ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,0105 earth and related environmental sciences ,computer.programming_language ,Block (data storage) - Abstract
Most multi-scale detectors face a challenge of small-size false positives due to the inadequacy of low-level features, which have small receptive field sizes and weak semantic capabilities. This paper demonstrates independent predictions from different feature layers on the same region is beneficial for reducing false positives. We propose a novel light-weight previewer block, which previews the objectness probability for the potential regression region of each prior box, using the stronger features with larger receptive fields and more contextual information for better predictions. This previewer block is generic and can be easily implemented in multi-scale detectors, such as SSD, RFBNet and MS-CNN. Extensive experiments are conducted on PASCAL VOC and KITTI pedestrian benchmark to show the superiority of the proposed method.
- Published
- 2018
- Full Text
- View/download PDF