Back to Search Start Over

Mask Assisted Object Coding with Deep Learning for Object Retrieval in Surveillance Videos

Authors :
Hanqing Lu
Min Xu
Jinqiao Wang
Kezhen Teng
Source :
ACM Multimedia
Publication Year :
2014
Publisher :
ACM, 2014.

Abstract

Retrieving visual object from a large-scale video dataset is one of multimedia research focuses but a challenging task due to imprecise object extraction and partial occlusion. This paper presents a novel approach to efficiently encode and retrieve visual objects, which addresses some practical complications in surveillance videos. Specifically, we take advantage of the mask information to assist object representation, and develop an encoding method by utilizing highly nonlinear mapping with a deep neural network. Furthermore, we add some occluded noise into the learning process to enhance the robustness of dealing with background noise and partial occlusions. A real-life surveillance video data containing over 10 million objects are built to evaluate the proposed approach. Experimental results show our approach significantly outperforms state-of-the-art solutions for object retrieval in large-scale video dataset.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 22nd ACM international conference on Multimedia
Accession number :
edsair.doi.dedup.....6e2e0f1b5d42687dd5635ed9a2ed6287
Full Text :
https://doi.org/10.1145/2647868.2654981