Back to Search
Start Over
A part-based spatial and temporal aggregation method for dynamic scene recognition.
- Source :
-
Neural Computing & Applications . Jul2021, Vol. 33 Issue 13, p7353-7370. 18p. - Publication Year :
- 2021
-
Abstract
- Existing methods for dynamic scene recognition mostly use global features extracted from the entire video frame or a video segment. In this paper, a part-based method is proposed to aggregate local features from video frames. A pre-trained Fast R-CNN model is used to extract local convolutional features from the regions of interest of training images. These features are clustered to locate representative parts. A set cover problem is then formulated to select the discriminative parts, which are further refined by fine-tuning the Fast R-CNN model. Local features from a video segment are extracted at different layers of the fine-tuned Fast R-CNN model and aggregated both spatially and temporally. Extensive experimental results show that the proposed method is very competitive with state-of-the-art approaches. [ABSTRACT FROM AUTHOR]
- Subjects :
- *OBJECT recognition (Computer vision)
*VIDEOS
Subjects
Details
- Language :
- English
- ISSN :
- 09410643
- Volume :
- 33
- Issue :
- 13
- Database :
- Academic Search Index
- Journal :
- Neural Computing & Applications
- Publication Type :
- Academic Journal
- Accession number :
- 151065852
- Full Text :
- https://doi.org/10.1007/s00521-020-05415-3