251. 基于时空域深度特征两级编码融合的视频分类.
- Author
-
智洪欣, 于洪涛, and 李邵梅
- Abstract
To solve the problem of low performance of deep features used for video classification, this paper proposed a novel method based on the fusion of video temporal and spatial information with two-level encoding method. Firstly, it used two convolutional neural networks (CNN) to respectively extract the video's spatial and temporal information. Then encoding the spatial and temporal information with Fisher vector (FV) and locally aggregating method respectively to get the effective representation of video. Finally, based on the two-level cascaded fusion feature, it used support vector machine (SVM) to classify the videos. Experimental results on UCF101 show that their method has a better performance contrasting to the state of the art methods. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF