Back to Search
Start Over
Semantic action recognition by learning a pose lexicon
- Source :
- Pattern Recognition. 72:548-562
- Publication Year :
- 2017
- Publisher :
- Elsevier BV, 2017.
-
Abstract
- This paper proposes a semantic representation, pose lexicon , for action recognition. The lexicon is composed of a set of semantic poses, a set of visual poses and a probabilistic mapping between the visual and semantic poses. Specially, an action can be represented by a sequence of semantic poses extracted from an associated textual instruction. Visual frames of the action are considered to be generated from a sequence of hidden visual poses. To learn the lexicon, a visual pose model is learned from training samples by a Gaussian Mixture model to characterize the likelihood of an observed visual frame being generated by a visual pose. A pose lexicon model is also learned by an extended hidden Markov alignment model to encode the probabilistic mapping between hidden visual poses and semantic poses sequences. With the lexicon, action classification is formulated as a problem of finding the maximum posterior probability of a given sequence of visual frames that fits to a given sequence of semantic poses through the most likely visual pose and alignment sequences. The efficacy of the proposed method was evaluated on MSRC-12, WorkoutSU-10, WorkoutUOW-18, Combined-15 and Combined-17 action datasets using cross-subject, cross-dataset and zero-shot protocols.
- Subjects :
- Sequence
Computer science
business.industry
Posterior probability
020207 software engineering
02 engineering and technology
Mixture model
Lexicon
computer.software_genre
Action (philosophy)
Artificial Intelligence
Signal Processing
0202 electrical engineering, electronic engineering, information engineering
Frame (artificial intelligence)
020201 artificial intelligence & image processing
Computer Vision and Pattern Recognition
Artificial intelligence
Set (psychology)
Hidden Markov model
business
computer
Software
Natural language processing
Subjects
Details
- ISSN :
- 00313203
- Volume :
- 72
- Database :
- OpenAIRE
- Journal :
- Pattern Recognition
- Accession number :
- edsair.doi...........8d17e6382c944a63d86d3e0ff688bd73