Start Over

Video Classification With CNNs: Using the Codec as a Spatio-Temporal Activity Sensor.

Authors :: Chadha, Aaron
Abbas, Alhabib
Andreopoulos, Yiannis
Source :: IEEE Transactions on Circuits & Systems for Video Technology. Feb2019, Vol. 29 Issue 2, p475-485. 11p.
Publication Year :: 2019
Abstract: We investigate video classification via a two-stream convolutional neural network (CNN) design that directly ingests information extracted from compressed video bitstreams. Our approach begins with the observation that all modern video codecs divide the input frames into macroblocks (MBs). We demonstrate that selective access to MB motion vector (MV) information within compressed video bitstreams can also provide for selective, motion-adaptive, MB pixel decoding (a.k.a., MB texture decoding). This in turn allows for the derivation of spatio-temporal video activity regions at extremely high speed in comparison to conventional full-frame decoding followed by optical flow estimation. In order to evaluate the accuracy of a video classification framework based on such activity data, we independently train two CNN architectures on MB texture and MV correspondences and then fuse their scores to derive the final classification of each test video. Evaluation on two standard data sets shows that the proposed approach is competitive with the best two-stream video classification approaches found in the literature. At the same time: 1) a CPU-based realization of our MV extraction is over 977 times faster than GPU-based optical flow methods; 2) selective decoding is up to 12 times faster than full-frame decoding; and 3) our proposed spatial and temporal CNNs perform inference at 5 to 49 times lower cloud computing cost than the fastest methods from the literature. [ABSTRACT FROM AUTHOR]

Subjects :: *SPATIO-temporal variation
*CODECS
*IMAGE processing
*VIDEO compression
*STREAMING video & television

Details

Language :: English
ISSN :: 10518215
Volume :: 29
Issue :: 2
Database :: Academic Search Index
Journal :: IEEE Transactions on Circuits & Systems for Video Technology
Publication Type :: Academic Journal
Accession number :: 134602436
Full Text :: https://doi.org/10.1109/TCSVT.2017.2786999

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Video Classification With CNNs: Using the Codec as a Spatio-Temporal Activity Sensor.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Video Classification With CNNs: Using the Codec as a Spatio-Temporal Activity Sensor.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources