Back to Search Start Over

When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in Vlogs.

Authors :
IGNAT, OANA
SANTIAGO CASTRO
YUHANG ZHOU
JIAJUN BAO
SHAN, DANDAN
MIHALCEA, RADA
Source :
ACM Transactions on Multimedia Computing, Communications & Applications; 2022 Suppl 3, Vol. 18, p1-18, 18p
Publication Year :
2022

Abstract

We consider the task of temporal human action localization in lifestyle vlogs. We introduce a novel dataset consisting of manual annotations of temporal localization for 13,000 narrated actions in 1,200 video clips. We present an extensive analysis of this data, which allows us to better understand how the language and visual modalities interact throughout the videos. We propose a simple yet effective method to localize the narrated actions based on their expected duration. Through several experiments and analyses, we show that our method brings complementary information with respect to previous methods, and leads to improvements over previous work for the task of temporal action localization. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15516857
Volume :
18
Database :
Complementary Index
Journal :
ACM Transactions on Multimedia Computing, Communications & Applications
Publication Type :
Academic Journal
Accession number :
162602251
Full Text :
https://doi.org/10.1145/3495211