Ventral & Dorsal Stream Theory based Zero-Shot Action Recognition. (August 2021)
- Record Type:
- Journal Article
- Title:
- Ventral & Dorsal Stream Theory based Zero-Shot Action Recognition. (August 2021)
- Main Title:
- Ventral & Dorsal Stream Theory based Zero-Shot Action Recognition
- Authors:
- Xing, Meng
Feng, Zhiyong
Su, Yong
Peng, Weilong
Zhang, Jianhai - Abstract:
- Highlights: A general joint-conditional-probability framework is proposed to explain the inference mechanism of ZSAR methods. As far as we know, we are the first to propose the probabilistic model for the inference mechanism of ZSAR methods. A novel nonlinear similarity metric learning mechanism is proposed to establish the nonlinear mapping relationship between the visual space and the semantic space. A Ventral & Dorsal Stream Theory based Zero-Shot Action Recognition approach (VD-ZSAR) is proposed and achieved favorable performance on three benchmark datasets. The Ventral & Dorsal Stream Theory is introduced to extract irredundant visual feature, which can relieve relation ambiguity caused by redundant visual feature. Abstract: Most Zero-Shot Action Recognition (ZSAR) methods establish visual-semantic joint embedding space, which is based on commonly used visual features and semantic embeddings, to learn the correlation between actions. Nevertheless, extracting visual features without structural guidance would lead to sparse video features, which reflect the correlation of actions, fall into oblivion. Based on the Ventral & Dorsal Stream Theory (VD), we propose a VD-ZSAR method to extract irredundant visual feature, which can relieve relation ambiguity caused by redundant visual feature. And a visual-semantic joint embedding space is learned by combining nonredundant visual space with semantic space. Specifically, visual space is constructed by the motion cues perceived byHighlights: A general joint-conditional-probability framework is proposed to explain the inference mechanism of ZSAR methods. As far as we know, we are the first to propose the probabilistic model for the inference mechanism of ZSAR methods. A novel nonlinear similarity metric learning mechanism is proposed to establish the nonlinear mapping relationship between the visual space and the semantic space. A Ventral & Dorsal Stream Theory based Zero-Shot Action Recognition approach (VD-ZSAR) is proposed and achieved favorable performance on three benchmark datasets. The Ventral & Dorsal Stream Theory is introduced to extract irredundant visual feature, which can relieve relation ambiguity caused by redundant visual feature. Abstract: Most Zero-Shot Action Recognition (ZSAR) methods establish visual-semantic joint embedding space, which is based on commonly used visual features and semantic embeddings, to learn the correlation between actions. Nevertheless, extracting visual features without structural guidance would lead to sparse video features, which reflect the correlation of actions, fall into oblivion. Based on the Ventral & Dorsal Stream Theory (VD), we propose a VD-ZSAR method to extract irredundant visual feature, which can relieve relation ambiguity caused by redundant visual feature. And a visual-semantic joint embedding space is learned by combining nonredundant visual space with semantic space. Specifically, visual space is constructed by the motion cues perceived by Dorsal Stream, and the object cues perceived by Ventral Stream. Semantic space is constructed by sentence-to-vector generator. The visual-semantic joint embedding space is built by a nonlinear similarity metric learning mechanism, which can better implicitly reflect the correlation between actions. Extensive experiments on the Olympic, HDMB51 and UCF101 datasets validate the favorable performance of our proposed approach. … (more)
- Is Part Of:
- Pattern recognition. Volume 116(2021)
- Journal:
- Pattern recognition
- Issue:
- Volume 116(2021)
- Issue Display:
- Volume 116, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 116
- Issue:
- 2021
- Issue Sort Value:
- 2021-0116-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-08
- Subjects:
- VD-ZSAR -- Ventral & Dorsal Stream Theory -- Nonlinear similarity metric learning mechanism
00-01 -- 99-00
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2021.107953 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16889.xml