Back to Search Start Over

Action boundaries detection in a video

Authors :
Bassem Haidar
Hassan Wehbe
Philippe Joly
Centre National de la Recherche Scientifique - CNRS (FRANCE)
Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
Lebanese University - LU (LEBANON)
Université Toulouse III - Paul Sabatier - UT3 (FRANCE)
Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université Toulouse 1 Capitole - UT1 (FRANCE)
Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA)
Institut de recherche en informatique de Toulouse (IRIT)
Université Toulouse 1 Capitole (UT1)
Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3)
Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP)
Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1)
Université Fédérale Toulouse Midi-Pyrénées
Lebanese International University (LIU)
Institut National Polytechnique de Toulouse - INPT (FRANCE)
Source :
Multimedia Tools and Applications, Multimedia Tools and Applications, Springer Verlag, 2015, vol. 75 (n° 14), pp. 8239-8266. ⟨10.1007/s11042-015-2748-5⟩
Publication Year :
2015
Publisher :
Springer-Verlag, 2015.

Abstract

International audience; In the video analysis domain, automatic detection of actions performed in a recorded video represents an important scientific and industrial challenge. This paper presents a new method to approximate the boundaries of actions performed by a person while interacting with his environment (such as moving objects). This method relies on a Codebook quantization method to analyze the rough evolution of each pixel and then decide whether this evolution corresponds to an action or not; this decision is taken by an automated system. Statistics are then produced - at the scale of the whole frame - to estimate the start and the end of an action. According to our proposed evaluation protocol, this method produces interesting results on both real and simulated videos. This statistic-based protocol is discussed at the end of this paper. The interpretation of this evaluation protocol nominates this method to be a solid base to localize the exact boundaries of actions or - in the framework of this research activity - to associate prescriptive text with a visual content.

Details

Language :
English
ISSN :
13807501 and 15737721
Database :
OpenAIRE
Journal :
Multimedia Tools and Applications, Multimedia Tools and Applications, Springer Verlag, 2015, vol. 75 (n° 14), pp. 8239-8266. ⟨10.1007/s11042-015-2748-5⟩
Accession number :
edsair.doi.dedup.....3f01af8df86adb9cbeafee058706bc79
Full Text :
https://doi.org/10.1007/s11042-015-2748-5⟩