Start Over

Action boundaries detection in a video

Authors :: Bassem Haidar
Hassan Wehbe
Philippe Joly
Centre National de la Recherche Scientifique - CNRS (FRANCE)
Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
Lebanese University - LU (LEBANON)
Université Toulouse III - Paul Sabatier - UT3 (FRANCE)
Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université Toulouse 1 Capitole - UT1 (FRANCE)
Équipe Structuration, Analyse et MOdélisation de documents Vidéo et Audio (IRIT-SAMoVA)
Institut de recherche en informatique de Toulouse (IRIT)
Université Toulouse 1 Capitole (UT1)
Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3)
Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP)
Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1)
Université Fédérale Toulouse Midi-Pyrénées
Lebanese International University (LIU)
Institut National Polytechnique de Toulouse - INPT (FRANCE)
Source :: Multimedia Tools and Applications, Multimedia Tools and Applications, Springer Verlag, 2015, vol. 75 (n° 14), pp. 8239-8266. ⟨10.1007/s11042-015-2748-5⟩
Publication Year :: 2015
Publisher :: Springer-Verlag, 2015.
Abstract: International audience; In the video analysis domain, automatic detection of actions performed in a recorded video represents an important scientific and industrial challenge. This paper presents a new method to approximate the boundaries of actions performed by a person while interacting with his environment (such as moving objects). This method relies on a Codebook quantization method to analyze the rough evolution of each pixel and then decide whether this evolution corresponds to an action or not; this decision is taken by an automated system. Statistics are then produced - at the scale of the whole frame - to estimate the start and the end of an action. According to our proposed evaluation protocol, this method produces interesting results on both real and simulated videos. This statistic-based protocol is discussed at the end of this paper. The interpretation of this evaluation protocol nominates this method to be a solid base to localize the exact boundaries of actions or - in the framework of this research activity - to associate prescriptive text with a visual content.

Subjects :: 0209 industrial biotechnology
Computer Networks and Communications
Computer science
02 engineering and technology
Video analysis
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Codebook quantization
Traitement des images
020901 industrial engineering & automation
Segmentation
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
0202 electrical engineering, electronic engineering, information engineering
Media Technology
Traitement du signal et de l'image
Computer vision
Action detection
Synthèse d'image et réalité virtuelle
Pixel
business.industry
Quantization (signal processing)
Codebook
[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]
Vision par ordinateur et reconnaissance de formes
Intelligence artificielle
[INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR]
Hardware and Architecture
[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]
020201 artificial intelligence & image processing
Artificial intelligence
business
Software

Details

Language :: English
ISSN :: 13807501 and 15737721
Database :: OpenAIRE
Journal :: Multimedia Tools and Applications, Multimedia Tools and Applications, Springer Verlag, 2015, vol. 75 (n° 14), pp. 8239-8266. ⟨10.1007/s11042-015-2748-5⟩
Accession number :: edsair.doi.dedup.....3f01af8df86adb9cbeafee058706bc79
Full Text :: https://doi.org/10.1007/s11042-015-2748-5⟩