Mask Assisted Object Coding with Deep Learning for Object Retrieval in Surveillance Videos

Authors :: Hanqing Lu
Min Xu
Jinqiao Wang
Kezhen Teng
Source :: ACM Multimedia
Publication Year :: 2014
Publisher :: ACM, 2014.
Abstract: Retrieving visual object from a large-scale video dataset is one of multimedia research focuses but a challenging task due to imprecise object extraction and partial occlusion. This paper presents a novel approach to efficiently encode and retrieve visual objects, which addresses some practical complications in surveillance videos. Specifically, we take advantage of the mask information to assist object representation, and develop an encoding method by utilizing highly nonlinear mapping with a deep neural network. Furthermore, we add some occluded noise into the learning process to enhance the robustness of dealing with background noise and partial occlusions. A real-life surveillance video data containing over 10 million objects are built to evaluate the proposed approach. Experimental results show our approach significantly outperforms state-of-the-art solutions for object retrieval in large-scale video dataset.

Subjects :: Artificial neural network
Computer science
business.industry
Deep learning
Autoencoder
Background noise
Robustness (computer science)
Video tracking
Visual Objects
Computer vision
Artificial intelligence
business
computer
Coding (social sciences)
computer.programming_language

Database :: OpenAIRE
Journal :: Proceedings of the 22nd ACM international conference on Multimedia
Accession number :: edsair.doi.dedup.....6e2e0f1b5d42687dd5635ed9a2ed6287
Full Text :: https://doi.org/10.1145/2647868.2654981

Full Text Access

Tools