Two-stage generative adversarial networks for binarization of color document images. (October 2022)
- Record Type:
- Journal Article
- Title:
- Two-stage generative adversarial networks for binarization of color document images. (October 2022)
- Main Title:
- Two-stage generative adversarial networks for binarization of color document images
- Authors:
- Suh, Sungho
Kim, Jihun
Lukowicz, Paul
Lee, Yong Oh - Abstract:
- Highlights: We propose a two-stage color document image enhancement and binarization method using generative adversarial networks. Four color-independent adversarial networks extract color foreground information from an input image for document image enhancement. Two independent adversarial networks with global and local features are trained for image binarization of documents of variable size. The proposed method outperforms state-of-the-art algorithms on various datasets. Abstract: Document image enhancement and binarization methods are often used to improve the accuracy and efficiency of document image analysis tasks such as text recognition. Traditional non-machine-learning methods are constructed on low-level features in an unsupervised manner but have difficulty with binarization on documents with severely degraded backgrounds. Convolutional neural network (CNN)based methods focus only on grayscale images and on local textual features. In this paper, we propose a two-stage color document image enhancement and binarization method using generative adversarial neural networks. In the first stage, four color-independent adversarial networks are trained to extract color foreground information from an input image for document image enhancement. In the second stage, two independent adversarial networks with global and local features are trained for image binarization of documents of variable size. For the adversarial neural networks, we formulate loss functions between aHighlights: We propose a two-stage color document image enhancement and binarization method using generative adversarial networks. Four color-independent adversarial networks extract color foreground information from an input image for document image enhancement. Two independent adversarial networks with global and local features are trained for image binarization of documents of variable size. The proposed method outperforms state-of-the-art algorithms on various datasets. Abstract: Document image enhancement and binarization methods are often used to improve the accuracy and efficiency of document image analysis tasks such as text recognition. Traditional non-machine-learning methods are constructed on low-level features in an unsupervised manner but have difficulty with binarization on documents with severely degraded backgrounds. Convolutional neural network (CNN)based methods focus only on grayscale images and on local textual features. In this paper, we propose a two-stage color document image enhancement and binarization method using generative adversarial neural networks. In the first stage, four color-independent adversarial networks are trained to extract color foreground information from an input image for document image enhancement. In the second stage, two independent adversarial networks with global and local features are trained for image binarization of documents of variable size. For the adversarial neural networks, we formulate loss functions between a discriminator and generators having an encoder–decoder structure. Experimental results show that the proposed method achieves better performance than many classical and state-of-the-art algorithms over the Document Image Binarization Contest (DIBCO) datasets, the LRDE Document Binarization Dataset (LRDE DBD), and our shipping label image dataset. We plan to release the shipping label dataset as well as our implementation code at github.com/opensuh/DocumentBinarization/ . … (more)
- Is Part Of:
- Pattern recognition. Volume 130(2022)
- Journal:
- Pattern recognition
- Issue:
- Volume 130(2022)
- Issue Display:
- Volume 130, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 130
- Issue:
- 2022
- Issue Sort Value:
- 2022-0130-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-10
- Subjects:
- Document image binarization -- Generative adversarial networks -- Optical character recognition -- Color document image enhancement
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2022.108810 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22236.xml