A compact representation of human actions by sliding coordinate coding. (14th December 2017)
- Record Type:
- Journal Article
- Title:
- A compact representation of human actions by sliding coordinate coding. (14th December 2017)
- Main Title:
- A compact representation of human actions by sliding coordinate coding
- Authors:
- Ding, Runwei
Sun, Qianru
Liu, Mengyuan
Liu, Hong - Abstract:
- Human action recognition remains challenging in realistic videos, where scale and viewpoint changes make the problem complicated. Many complex models have been developed to overcome these difficulties, while we explore using low-level features and typical classifiers to achieve the state-of-the-art performance. The baseline model of feature encoding for action recognition is bag-of-words model, which has shown high efficiency but ignores the arrangement of local features. Refined methods compensate for this problem by using a large number of co-occurrence descriptors or a concatenation of the local distributions in designed segments. In contrast, this article proposes to encode the relative position of visual words using a simple but very compact method called sliding coordinates coding (SCC). The SCC vector of each kind of word is only an eight-dimensional vector which is more compact than many of the spatial or spatial–temporal pooling methods in the literature. Our key observation is that the relative position is robust to the variations of video scale and view angle. Additionally, we design a temporal cutting scheme to define the margin of coding within video clips, since visual words far away from each other have little relationship. In experiments, four action data sets, including KTH, Rochester Activities, IXMAS, and UCF YouTube, are used for performance evaluation. Results show that our method achieves comparable or better performance than the state of the art, whileHuman action recognition remains challenging in realistic videos, where scale and viewpoint changes make the problem complicated. Many complex models have been developed to overcome these difficulties, while we explore using low-level features and typical classifiers to achieve the state-of-the-art performance. The baseline model of feature encoding for action recognition is bag-of-words model, which has shown high efficiency but ignores the arrangement of local features. Refined methods compensate for this problem by using a large number of co-occurrence descriptors or a concatenation of the local distributions in designed segments. In contrast, this article proposes to encode the relative position of visual words using a simple but very compact method called sliding coordinates coding (SCC). The SCC vector of each kind of word is only an eight-dimensional vector which is more compact than many of the spatial or spatial–temporal pooling methods in the literature. Our key observation is that the relative position is robust to the variations of video scale and view angle. Additionally, we design a temporal cutting scheme to define the margin of coding within video clips, since visual words far away from each other have little relationship. In experiments, four action data sets, including KTH, Rochester Activities, IXMAS, and UCF YouTube, are used for performance evaluation. Results show that our method achieves comparable or better performance than the state of the art, while using more compact and less complex models. … (more)
- Is Part Of:
- International journal of advanced robotic systems. Volume 14:Number 6(2017:Nov./Dec.)
- Journal:
- International journal of advanced robotic systems
- Issue:
- Volume 14:Number 6(2017:Nov./Dec.)
- Issue Display:
- Volume 14, Issue 6 (2017)
- Year:
- 2017
- Volume:
- 14
- Issue:
- 6
- Issue Sort Value:
- 2017-0014-0006-0000
- Page Start:
- Page End:
- Publication Date:
- 2017-12-14
- Subjects:
- Human action recognition -- bag-of-words model -- local feature
Robotics -- Periodicals
Robotics
Periodicals
629.892 - Journal URLs:
- http://arx.sagepub.com/ ↗
http://search.epnet.com/direct.asp?db=bch&jid=13CR&scope=site ↗
http://www.intechweb.org/journal.php?id=3 ↗
http://www.uk.sagepub.com/home.nav ↗ - DOI:
- 10.1177/1729881417746114 ↗
- Languages:
- English
- ISSNs:
- 1729-8806
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8192.xml