Could scene context be beneficial for scene text detection?. (October 2016)
- Record Type:
- Journal Article
- Title:
- Could scene context be beneficial for scene text detection?. (October 2016)
- Main Title:
- Could scene context be beneficial for scene text detection?
- Authors:
- Zhu, Anna
Gao, Renwu
Uchida, Seiichi - Abstract:
- Abstract: Scene text detection and scene segmentation are meaningful tasks in the computer vision field. Could the semantic scene segmentation assist scene text detection in any degree? For example, can we expect the probability of a region being text is low if its surrounding segment, i.e., its context, is labeled as sky? In this paper, we have a positive answer by constructing a scene context-based text detection model. In this model, we use texton features and a fully-connected conditional random field (CRF) to estimate pixel-level scene class׳s probability to be considered as image׳s context feature. Meanwhile, maximally stable extremal regions (MSERs) are extracted, integrated and extended as image patches of character candidates. Then, each image patch is fed to a simple two-layer convolutional neural network (CNN) to automatically extract its character feature. The averaged context feature of the corresponding patch is considered as the patch׳s context feature. The character feature and context feature are fused as the input into a support vector machine for text/non-text determination. Finally, as a post-processing, neighboring text regions are grouped hierarchically. The performance evaluation on ICDAR2013 and SVT databases, as well as a preliminary evaluation on a patch-level database, proves that the scene context can improve the performance of scene text detection. Moreover, the comparative study with state-of-the-art methods shows the top-level performance ofAbstract: Scene text detection and scene segmentation are meaningful tasks in the computer vision field. Could the semantic scene segmentation assist scene text detection in any degree? For example, can we expect the probability of a region being text is low if its surrounding segment, i.e., its context, is labeled as sky? In this paper, we have a positive answer by constructing a scene context-based text detection model. In this model, we use texton features and a fully-connected conditional random field (CRF) to estimate pixel-level scene class׳s probability to be considered as image׳s context feature. Meanwhile, maximally stable extremal regions (MSERs) are extracted, integrated and extended as image patches of character candidates. Then, each image patch is fed to a simple two-layer convolutional neural network (CNN) to automatically extract its character feature. The averaged context feature of the corresponding patch is considered as the patch׳s context feature. The character feature and context feature are fused as the input into a support vector machine for text/non-text determination. Finally, as a post-processing, neighboring text regions are grouped hierarchically. The performance evaluation on ICDAR2013 and SVT databases, as well as a preliminary evaluation on a patch-level database, proves that the scene context can improve the performance of scene text detection. Moreover, the comparative study with state-of-the-art methods shows the top-level performance of our method. Abstract : Highlights: It provides a new perspective for scene text detection. The semantic scene segmentation can assist scene text detection. The character feature and context feature are fused for text⧹non-text classification. Four variable controlled comparison experiments are tested on a patch-level dataset. … (more)
- Is Part Of:
- Pattern recognition. Volume 58(2016:Oct.)
- Journal:
- Pattern recognition
- Issue:
- Volume 58(2016:Oct.)
- Issue Display:
- Volume 58 (2016)
- Year:
- 2016
- Volume:
- 58
- Issue Sort Value:
- 2016-0058-0000-0000
- Page Start:
- 204
- Page End:
- 215
- Publication Date:
- 2016-10
- Subjects:
- Scene text detection -- Fully connected CRF -- Convolutional neural network -- Character feature -- Context feature
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2016.04.011 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 2200.xml