Improving patch-based scene text script identification with ensembles of conjoined networks. (July 2017)
- Record Type:
- Journal Article
- Title:
- Improving patch-based scene text script identification with ensembles of conjoined networks. (July 2017)
- Main Title:
- Improving patch-based scene text script identification with ensembles of conjoined networks
- Authors:
- Gomez, Lluis
Nicolaou, Anguelos
Karatzas, Dimosthenis - Abstract:
- Highlights: We present a patch-based classification method for script identificattion in the wild. We describe a novel method based on the use of ensembles of conjoined networks (ECN). The ECN learns discriminative local features and their relative importance in a global classification rule. Our experiments demonstrate state-of-the-art results in three script identification datasets. Abstract: This paper focuses on the problem of script identification in scene text images. Facing this problem with state of the art CNN classifiers is not straightforward, as they fail to address a key characteristic of scene text instances: their extremely variable aspect ratio. Instead of resizing input images to a fixed aspect ratio as in the typical use of holistic CNN classifiers, we propose here a patch-based classification framework in order to preserve discriminative parts of the image that are characteristic of its class. We describe a novel method based on the use of ensembles of conjoined networks to jointly learn discriminative stroke-parts representations and their relative importance in a patch-based classification scheme. Our experiments with this learning procedure demonstrate state-of-the-art results in two public script identification datasets. In addition, we propose a new public benchmark dataset for the evaluation of multi-lingual scene text end-to-end reading systems. Experiments done in this dataset demonstrate the key role of script identification in a completeHighlights: We present a patch-based classification method for script identificattion in the wild. We describe a novel method based on the use of ensembles of conjoined networks (ECN). The ECN learns discriminative local features and their relative importance in a global classification rule. Our experiments demonstrate state-of-the-art results in three script identification datasets. Abstract: This paper focuses on the problem of script identification in scene text images. Facing this problem with state of the art CNN classifiers is not straightforward, as they fail to address a key characteristic of scene text instances: their extremely variable aspect ratio. Instead of resizing input images to a fixed aspect ratio as in the typical use of holistic CNN classifiers, we propose here a patch-based classification framework in order to preserve discriminative parts of the image that are characteristic of its class. We describe a novel method based on the use of ensembles of conjoined networks to jointly learn discriminative stroke-parts representations and their relative importance in a patch-based classification scheme. Our experiments with this learning procedure demonstrate state-of-the-art results in two public script identification datasets. In addition, we propose a new public benchmark dataset for the evaluation of multi-lingual scene text end-to-end reading systems. Experiments done in this dataset demonstrate the key role of script identification in a complete end-to-end system that combines our script identification method with a previously published text detector and an off-the-shelf OCR engine. … (more)
- Is Part Of:
- Pattern recognition. Volume 67(2017:Jul.)
- Journal:
- Pattern recognition
- Issue:
- Volume 67(2017:Jul.)
- Issue Display:
- Volume 67 (2017)
- Year:
- 2017
- Volume:
- 67
- Issue Sort Value:
- 2017-0067-0000-0000
- Page Start:
- 85
- Page End:
- 96
- Publication Date:
- 2017-07
- Subjects:
- Script identification -- Scene text understanding -- Multi-language OCR -- Convolutional neural networks -- Ensemble of conjoined networks
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2017.01.032 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1166.xml