Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing. (December 2022)
- Record Type:
- Journal Article
- Title:
- Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing. (December 2022)
- Main Title:
- Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing
- Authors:
- Li, Xiao-Hui
Yin, Fei
Dai, He-Sen
Liu, Cheng-Lin - Abstract:
- Highlights: A graph neural network (GNN) based unified framework named TSRNet is proposed to jointly detect and recognize the structures of various tables and forms. GNN is used to classify and group primitive regions into page objects and classify the relationships. The parameters of all the modules in the system is trained end-to-end to optimize the overall performance Superior performance has been achieved on table detection, table structure recognition and template-free form parsing. Abstract: The recognition of two-dimensional structure of tables and forms from document images is a challenge due to the complexity of document structures and the diversity of layouts. In this paper, we propose a graph neural network (GNN) based unified framework named Table Structure Recognition Network (TSRNet) to jointly detect and recognize the structures of various tables and forms. First, a multi-task fully convolutional network (FCN) is used to segment primitive regions such as text segments and ruling lines from document images, then a GNN is used to classify and group these primitive regions into page objects such as tables and cells. At last, the relationships between neighboring page objects are analyzed using another GNN based parsing module. The parameters of all the modules in the system can be trained end-to-end to optimize the overall performance. Experiments of table detection and structure recognition for modern documents on the POD 2017, cTDaR 2019 and PubTabNet datasetsHighlights: A graph neural network (GNN) based unified framework named TSRNet is proposed to jointly detect and recognize the structures of various tables and forms. GNN is used to classify and group primitive regions into page objects and classify the relationships. The parameters of all the modules in the system is trained end-to-end to optimize the overall performance Superior performance has been achieved on table detection, table structure recognition and template-free form parsing. Abstract: The recognition of two-dimensional structure of tables and forms from document images is a challenge due to the complexity of document structures and the diversity of layouts. In this paper, we propose a graph neural network (GNN) based unified framework named Table Structure Recognition Network (TSRNet) to jointly detect and recognize the structures of various tables and forms. First, a multi-task fully convolutional network (FCN) is used to segment primitive regions such as text segments and ruling lines from document images, then a GNN is used to classify and group these primitive regions into page objects such as tables and cells. At last, the relationships between neighboring page objects are analyzed using another GNN based parsing module. The parameters of all the modules in the system can be trained end-to-end to optimize the overall performance. Experiments of table detection and structure recognition for modern documents on the POD 2017, cTDaR 2019 and PubTabNet datasets and template-free form parsing for historical documents on the NAF dataset show that the proposed method can handle various table/form structures and achieve superior performance. … (more)
- Is Part Of:
- Pattern recognition. Volume 132(2022)
- Journal:
- Pattern recognition
- Issue:
- Volume 132(2022)
- Issue Display:
- Volume 132, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 132
- Issue:
- 2022
- Issue Sort Value:
- 2022-0132-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-12
- Subjects:
- Table detection -- Table structure recognition -- Template-free form parsing -- Graph neural network -- End-to-end training
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2022.108946 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23281.xml