Zero-shot Handwritten Chinese Character Recognition with hierarchical decomposition embedding. (November 2020)
- Record Type:
- Journal Article
- Title:
- Zero-shot Handwritten Chinese Character Recognition with hierarchical decomposition embedding. (November 2020)
- Main Title:
- Zero-shot Handwritten Chinese Character Recognition with hierarchical decomposition embedding
- Authors:
- Cao, Zhong
Lu, Jiang
Cui, Sen
Zhang, Changshui - Abstract:
- Highlights: A novel hierarchical decomposing embedding (HDE) method is proposed. The framework HDE-Net is proposed for zero-shot learning. HDE-Net achieves state-of-the-art results on CASIA-HWDB, ICDAR, CTW datasets. Qualitative and quantitative analyses demonstrate the effectiveness of the proposed framework. Abstract: Handwritten Chinese Character Recognition (HCCR) is a challenging topic in the field of pattern recognition due to large-scale character vocabulary, complex hierarchical structure, various writing styles, and scarce training samples. In this paper, we explored the hierarchical knowledge of Chinese characters and presented a novel zero-shot HCCR method. First, we handled the relations between the characters and their primitives, such as radicals and structures, to obtain a tree layout of primitives. Then, we presented a novel zero-shot hierarchical decomposition embedding method to encode the tree layout into a semantic vector. Next, we devised a Convolutional Neural Network (CNN) based framework to learn both radicals and structures of characters via the semantic vector. As different Chinese characters share some common radicals and structures, our method is able to recognize new categories without any labeled samples from them. Moreover, our method is effective in both traditional HCCR and zero-shot HCCR tasks. It achieves competitive performance on the traditional experiment setting and significantly surpasses the state-of-the-art methods on the zero-shotHighlights: A novel hierarchical decomposing embedding (HDE) method is proposed. The framework HDE-Net is proposed for zero-shot learning. HDE-Net achieves state-of-the-art results on CASIA-HWDB, ICDAR, CTW datasets. Qualitative and quantitative analyses demonstrate the effectiveness of the proposed framework. Abstract: Handwritten Chinese Character Recognition (HCCR) is a challenging topic in the field of pattern recognition due to large-scale character vocabulary, complex hierarchical structure, various writing styles, and scarce training samples. In this paper, we explored the hierarchical knowledge of Chinese characters and presented a novel zero-shot HCCR method. First, we handled the relations between the characters and their primitives, such as radicals and structures, to obtain a tree layout of primitives. Then, we presented a novel zero-shot hierarchical decomposition embedding method to encode the tree layout into a semantic vector. Next, we devised a Convolutional Neural Network (CNN) based framework to learn both radicals and structures of characters via the semantic vector. As different Chinese characters share some common radicals and structures, our method is able to recognize new categories without any labeled samples from them. Moreover, our method is effective in both traditional HCCR and zero-shot HCCR tasks. It achieves competitive performance on the traditional experiment setting and significantly surpasses the state-of-the-art methods on the zero-shot experiment setting. … (more)
- Is Part Of:
- Pattern recognition. Volume 107(2020:Nov.)
- Journal:
- Pattern recognition
- Issue:
- Volume 107(2020:Nov.)
- Issue Display:
- Volume 107 (2020)
- Year:
- 2020
- Volume:
- 107
- Issue Sort Value:
- 2020-0107-0000-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-11
- Subjects:
- Chinese character recognition -- Radical analysis -- Zero-shot learning -- Label embedding
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2020.107488 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19108.xml