1. LARGER RECEPTIVE FIELD BASED RGB VISUAL RELOCALIZATION METHOD USING CONVOLUTIONAL NETWORK
- Author
-
Deren Li, Jiangying Qin, Ming Li, Hanqi Zhang, Xuan Liao, Jiageng Zhong, Paparoditis, Nicolas, Mallet, Clément, Lafarge, Florent, and et al.
- Subjects
Technology ,Computer science ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Visual relocalization ,Camera relocalization ,Pose regression ,Deep ConvNet ,RGB image ,Convolutional neural network ,Image (mathematics) ,Robustness (computer science) ,Applied optics. Photonics ,Computer vision ,Single image ,Pose ,business.industry ,Engineering (General). Civil engineering (General) ,TA1501-1820 ,Receptive field ,Key (cryptography) ,RGB color model ,Artificial intelligence ,TA1-2040 ,business - Abstract
Visual Relocalization is a key technology in many computer vision applications. Traditional visual relocalization is mainly achieved through geometric methods, while PoseNet introduces convolutional neural network in visual relocalization for the first time to realize real-time camera pose estimation based on a single image. Aiming at the problem of accuracy and robustness of the current PoseNet algorithm in complex environment, this paper proposes and implements a new high-precision robust camera pose calculation method (LRF-PoseNet). This method directly adjusts the size of the input image without cropping, so as to increase the receptive field of the training image. Then, the image and the corresponding pose tags are input into the improved LSTM-based PoseNet network for training, and the Adam optimizer is used to optimize the network. Finally, the trained network is used to estimate the camera pose. Experimental results on open RGB dataset show that the proposed method in this paper can obtain more accurate camera pose compared with the existing CNN-based methods., International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLIII-B2-2021, ISSN:1682-1750, ISSN:2194-9034, ISSN:1682-1777
- Published
- 2021