Semantic action recognition by learning a pose lexicon. (December 2017)
- Record Type:
- Journal Article
- Title:
- Semantic action recognition by learning a pose lexicon. (December 2017)
- Main Title:
- Semantic action recognition by learning a pose lexicon
- Authors:
- Zhou, Lijuan
Li, Wanqing
Ogunbona, Philip
Zhang, Zhengyou - Abstract:
- Highlights: A novel semantic representation, pose lexicon, is proposed for action recognition. An extended hidden Markov alignment model is developed to learn a pose lexicon. A semantic action recognition method that is capable of zero-shot recognition is developed upon the lexicon. The efficacy of the proposed learning and recognition algorithms were evaluated on five datasets using cross-subject, cross-dataset and zero-shot protocols. Abstract: This paper proposes a semantic representation, pose lexicon, for action recognition. The lexicon is composed of a set of semantic poses, a set of visual poses and a probabilistic mapping between the visual and semantic poses. Specially, an action can be represented by a sequence of semantic poses extracted from an associated textual instruction. Visual frames of the action are considered to be generated from a sequence of hidden visual poses. To learn the lexicon, a visual pose model is learned from training samples by a Gaussian Mixture model to characterize the likelihood of an observed visual frame being generated by a visual pose. A pose lexicon model is also learned by an extended hidden Markov alignment model to encode the probabilistic mapping between hidden visual poses and semantic poses sequences. With the lexicon, action classification is formulated as a problem of finding the maximum posterior probability of a given sequence of visual frames that fits to a given sequence of semantic poses through the most likely visualHighlights: A novel semantic representation, pose lexicon, is proposed for action recognition. An extended hidden Markov alignment model is developed to learn a pose lexicon. A semantic action recognition method that is capable of zero-shot recognition is developed upon the lexicon. The efficacy of the proposed learning and recognition algorithms were evaluated on five datasets using cross-subject, cross-dataset and zero-shot protocols. Abstract: This paper proposes a semantic representation, pose lexicon, for action recognition. The lexicon is composed of a set of semantic poses, a set of visual poses and a probabilistic mapping between the visual and semantic poses. Specially, an action can be represented by a sequence of semantic poses extracted from an associated textual instruction. Visual frames of the action are considered to be generated from a sequence of hidden visual poses. To learn the lexicon, a visual pose model is learned from training samples by a Gaussian Mixture model to characterize the likelihood of an observed visual frame being generated by a visual pose. A pose lexicon model is also learned by an extended hidden Markov alignment model to encode the probabilistic mapping between hidden visual poses and semantic poses sequences. With the lexicon, action classification is formulated as a problem of finding the maximum posterior probability of a given sequence of visual frames that fits to a given sequence of semantic poses through the most likely visual pose and alignment sequences. The efficacy of the proposed method was evaluated on MSRC-12, WorkoutSU-10, WorkoutUOW-18, Combined-15 and Combined-17 action datasets using cross-subject, cross-dataset and zero-shot protocols. … (more)
- Is Part Of:
- Pattern recognition. Volume 72(2017:Dec.)
- Journal:
- Pattern recognition
- Issue:
- Volume 72(2017:Dec.)
- Issue Display:
- Volume 72 (2017)
- Year:
- 2017
- Volume:
- 72
- Issue Sort Value:
- 2017-0072-0000-0000
- Page Start:
- 548
- Page End:
- 562
- Publication Date:
- 2017-12
- Subjects:
- Lexicon -- Semantic pose -- Visual pose -- Action recognition
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2017.06.035 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 4666.xml