Back to Search Start Over

LPR: learning point-level temporal action localization through re-training.

Authors :
Fang, Zhenying
Fan, Jianping
Yu, Jun
Source :
Multimedia Systems. Oct2023, Vol. 29 Issue 5, p2545-2562. 18p.
Publication Year :
2023

Abstract

Point-level temporal action localization (PTAL) aims to locate action instances in untrimmed videos with only one timestamp annotation for each action instance. Existing methods adopt the localization-by-classification paradigm to locate action boundaries in the temporal class activation map (TCAM) by thresholding, also known as TCAM-based method. However, TCAM-based methods are limited by the gap between classification and localization tasks, since TCAM is generated by a classification network. To address this issue, we propose a re-training framework for the PTAL task, also known as LPR. This framework consists of two stages: pseudo-label generation and re-training. In the pseudo-label generation stage, we propose a feature embedding module based on a transformer encoder to capture global context features and optimize pseudo-labels' quality by leveraging point-level annotations. In the re-training stage, LPR uses the above pseudo-labels as supervision to locate action instances with a temporal action localization network rather than generating TCAMs. Furthermore, to alleviate the effects of label noise in the pseudo-labels, we propose a joint learning classification module (JLCM) in the re-training stage. This module contains two classification sub-modules that simultaneously predict action categories and are guided by a jointly determined clean set for network training. The proposed framework achieves state-of-the-art localization performance on both the THUMOS'14 and BEOID datasets. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
*LEARNING modules

Details

Language :
English
ISSN :
09424962
Volume :
29
Issue :
5
Database :
Academic Search Index
Journal :
Multimedia Systems
Publication Type :
Academic Journal
Accession number :
171993647
Full Text :
https://doi.org/10.1007/s00530-023-01128-4