Back to Search
Start Over
Mask Assisted Object Coding with Deep Learning for Object Retrieval in Surveillance Videos
- Source :
- ACM Multimedia
- Publication Year :
- 2014
- Publisher :
- ACM, 2014.
-
Abstract
- Retrieving visual object from a large-scale video dataset is one of multimedia research focuses but a challenging task due to imprecise object extraction and partial occlusion. This paper presents a novel approach to efficiently encode and retrieve visual objects, which addresses some practical complications in surveillance videos. Specifically, we take advantage of the mask information to assist object representation, and develop an encoding method by utilizing highly nonlinear mapping with a deep neural network. Furthermore, we add some occluded noise into the learning process to enhance the robustness of dealing with background noise and partial occlusions. A real-life surveillance video data containing over 10 million objects are built to evaluate the proposed approach. Experimental results show our approach significantly outperforms state-of-the-art solutions for object retrieval in large-scale video dataset.
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of the 22nd ACM international conference on Multimedia
- Accession number :
- edsair.doi.dedup.....6e2e0f1b5d42687dd5635ed9a2ed6287
- Full Text :
- https://doi.org/10.1145/2647868.2654981