Back to Search Start Over

TagBook: A Semantic Video Representation Without Supervision for Event Detection

Authors :
Masoud Mazloom
Cees G. M. Snoek
Xirong Li
Intelligent Sensory Information Systems (IVI, FNWI)
Source :
IEEE Transactions on Multimedia, 18(7), 1378-1388. Institute of Electrical and Electronics Engineers Inc.
Publication Year :
2016
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2016.

Abstract

We consider the problem of event detection in video for scenarios where only few, or even zero examples are available for training. For this challenging setting, the prevailing solutions in the literature rely on a semantic video representation obtained from thousands of pre-trained concept detectors. Different from existing work, we propose a new semantic video representation that is based on freely available social tagged videos only, without the need for training any intermediate concept detectors. We introduce a simple algorithm that propagates tags from a video's nearest neighbors, similar in spirit to the ones used for image retrieval, but redesign it for video event detection by including video source set refinement and varying the video tag assignment. We call our approach TagBook and study its construction, descriptiveness and detection performance on the TRECVID 2013 and 2014 multimedia event detection datasets and the Columbia Consumer Video dataset. Despite its simple nature, the proposed TagBook video representation is remarkably effective for few-example and zero-example event detection, even outperforming very recent state-of-the-art alternatives building on supervised representations.<br />Comment: accepted for publication as a regular paper in the IEEE Transactions on Multimedia

Details

ISSN :
19410077 and 15209210
Volume :
18
Database :
OpenAIRE
Journal :
IEEE Transactions on Multimedia
Accession number :
edsair.doi.dedup.....efd184b94c88b48b2f8c0ea83fccae86