Tree-based data augmentation and mutual learning for offline handwritten mathematical expression recognition. (December 2022)
- Record Type:
- Journal Article
- Title:
- Tree-based data augmentation and mutual learning for offline handwritten mathematical expression recognition. (December 2022)
- Main Title:
- Tree-based data augmentation and mutual learning for offline handwritten mathematical expression recognition
- Authors:
- Yang, Chen
Du, Jun
Zhang, Jianshu
Wu, Changjie
Chen, Mingjun
Wu, JiaJia - Abstract:
- Highlights: We propose a tree-based multi-level data augmentation strategy to effectively alleviate the problem of insufficient original annotation data, which is one of the critical technology to our champion system for the OffRaSHME20 competition. We introduce a novel tree-based mutual learning method to deeply integrate the string decoder and the tree decoder in both the training and inference stages, which fully complement the advantages of these two types of decoders. Our system significantly outperforms the other state-of-the-art results on both the OffRaSHME20 dataset and the CROHME14/16/19 datasets. Abstract: Recently, thanks to the successful application of the attention-based encoder-decoder framework, handwritten mathematical expression recognition (HMER) has achieved significant improvement. However, HMER is still a challenging task in the handwriting recognition area, which suffers from the ambiguity of handwritten symbols, the two-dimensional structure of mathematical expressions, and the lack of labeled data. In this paper, we attempt to improve the recognition performance and generalization ability of the existing state-of-the-art method from two perspectives: data augmentation and model design. We first propose a tree-based multi-level (including symbol level, sub-expression level, and image level) data augmentation strategy, which can generate many synthetic images. Then, we present a novel encoder-decoder hybrid model via tree-based mutual learning toHighlights: We propose a tree-based multi-level data augmentation strategy to effectively alleviate the problem of insufficient original annotation data, which is one of the critical technology to our champion system for the OffRaSHME20 competition. We introduce a novel tree-based mutual learning method to deeply integrate the string decoder and the tree decoder in both the training and inference stages, which fully complement the advantages of these two types of decoders. Our system significantly outperforms the other state-of-the-art results on both the OffRaSHME20 dataset and the CROHME14/16/19 datasets. Abstract: Recently, thanks to the successful application of the attention-based encoder-decoder framework, handwritten mathematical expression recognition (HMER) has achieved significant improvement. However, HMER is still a challenging task in the handwriting recognition area, which suffers from the ambiguity of handwritten symbols, the two-dimensional structure of mathematical expressions, and the lack of labeled data. In this paper, we attempt to improve the recognition performance and generalization ability of the existing state-of-the-art method from two perspectives: data augmentation and model design. We first propose a tree-based multi-level (including symbol level, sub-expression level, and image level) data augmentation strategy, which can generate many synthetic images. Then, we present a novel encoder-decoder hybrid model via tree-based mutual learning to fully utilize the complementarity between tree decoder and string decoder. Benefitting from our data augmentation strategy, we achieve 58.47%/57.82%/62.67% and 74.45% expression recognition accuracy respectively on the CROHME14/16/19 competition datasets and the OffRaSHME20 competition dataset. Moreover, tree-based data augmentation is a key technology to our champion system for the OffRaSHME20 competition. Our tree-based mutual learning method further improves the recognition accuracy to 61.63%/59.81%/64.38% and 75.68% on these datasets. Further quantitative and qualitative analyses also demonstrate the effectiveness and robustness of our proposed methods. … (more)
- Is Part Of:
- Pattern recognition. Volume 132(2022)
- Journal:
- Pattern recognition
- Issue:
- Volume 132(2022)
- Issue Display:
- Volume 132, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 132
- Issue:
- 2022
- Issue Sort Value:
- 2022-0132-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-12
- Subjects:
- Tree-based data augmentation -- Tree-based mutual learning -- Encoder-decoder -- Offline handwritten mathematical expression recognition
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2022.108910 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23281.xml