Few-shot prototype alignment regularization network for document image layout segementation. (July 2021)
- Record Type:
- Journal Article
- Title:
- Few-shot prototype alignment regularization network for document image layout segementation. (July 2021)
- Main Title:
- Few-shot prototype alignment regularization network for document image layout segementation
- Authors:
- Li, Yujie
Zhang, Pengfei
Xu, Xing
Lai, Yi
Shen, Fumin
Chen, Lijiang
Gao, Pengxiang - Abstract:
- Highlights: A novel method for the document image layout analysis problem is proposed. It makes better use of the information of the support set by metric learning. It learns classification prototype within an embedding space. Prototype alignment regularization term between support and query sets is developed. Abstract: Despite the great performance in layout analysis tasks made by semantic segmentation, they usually need a large number of annotated images for training and are difficult to learn a new category which is absent in the training categories. Meta-learning and few-shot segmentation have been developed to solve the above two difficulties. In this paper, we propose a novel method dubbed Few-Shot Prototype Alignment Regularization Network (FS-PARN). The FS-PARN method is inspired by recent studies in both metric learning and few-shot segmentation, which just need a few annotated images to solve the above two difficulties. Our FS-PARN method can make better use of the information of the support set by metric learning and have a better effect on image segmentation. It learns classification prototype within an embedding space and then completes pixel classification by matching each pixel on the query image with the learned prototype. In addition to obtaining high-quality prototypes through metric learning methods, our FS-PARN method also introduces prototype alignment regularization between support and query sets to make segmentation better. Notably, our FS-PARN modelHighlights: A novel method for the document image layout analysis problem is proposed. It makes better use of the information of the support set by metric learning. It learns classification prototype within an embedding space. Prototype alignment regularization term between support and query sets is developed. Abstract: Despite the great performance in layout analysis tasks made by semantic segmentation, they usually need a large number of annotated images for training and are difficult to learn a new category which is absent in the training categories. Meta-learning and few-shot segmentation have been developed to solve the above two difficulties. In this paper, we propose a novel method dubbed Few-Shot Prototype Alignment Regularization Network (FS-PARN). The FS-PARN method is inspired by recent studies in both metric learning and few-shot segmentation, which just need a few annotated images to solve the above two difficulties. Our FS-PARN method can make better use of the information of the support set by metric learning and have a better effect on image segmentation. It learns classification prototype within an embedding space and then completes pixel classification by matching each pixel on the query image with the learned prototype. In addition to obtaining high-quality prototypes through metric learning methods, our FS-PARN method also introduces prototype alignment regularization between support and query sets to make segmentation better. Notably, our FS-PARN model achieves the mean-IoU score of 28.8% and 31.7% on the practical document image datasets, i.e. PASCAL-5i, DSSE-200, and Layout Analysis Dataset, for 1-shot and 5-shot settings respectively. … (more)
- Is Part Of:
- Pattern recognition. Volume 115(2021)
- Journal:
- Pattern recognition
- Issue:
- Volume 115(2021)
- Issue Display:
- Volume 115, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 115
- Issue:
- 2021
- Issue Sort Value:
- 2021-0115-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-07
- Subjects:
- Meta-learning -- Few-shot learning -- Metric learning -- Semantic segmentation
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2021.107882 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17362.xml