Text/non-text image classification in the wild with convolutional neural networks. (June 2017)
- Record Type:
- Journal Article
- Title:
- Text/non-text image classification in the wild with convolutional neural networks. (June 2017)
- Main Title:
- Text/non-text image classification in the wild with convolutional neural networks
- Authors:
- Bai, Xiang
Shi, Baoguang
Zhang, Chengquan
Cai, Xuan
Qi, Li - Abstract:
- Abstract: Text in natural images is an important source of information, which can be utilized for many real-world applications. This work focuses on a new problem: distinguishing images that contain text from a large volume of natural images. To address this problem, we propose a novel convolutional neural network variant, called multi-scale spatial partition network (MSP-Net). The network classifies images that contain text or not, by predicting text existence in all image blocks, which are spatial partitions at multiple scales on an input image. The whole image is classified as a text image (an image containing text) as long as one of the blocks is predicted to contain text. The network classifies images very efficiently by predicting all blocks simultaneously in a single forward propagation. Through experimental evaluations and comparisons on public datasets, we demonstrate the effectiveness and robustness of the proposed method. Abstract : Highlights: We study a new and important problem: text/non-text image classification in the wild. A new scheme based on block-level classification is proposed to tackle this problem. We propose MSP-Net, a novel CNN variant, to efficiently classify text/non-text images. As a by-product, MSP-Net outputs coarse locations and scales of texts.
- Is Part Of:
- Pattern recognition. Volume 66(2017:Jun.)
- Journal:
- Pattern recognition
- Issue:
- Volume 66(2017:Jun.)
- Issue Display:
- Volume 66 (2017)
- Year:
- 2017
- Volume:
- 66
- Issue Sort Value:
- 2017-0066-0000-0000
- Page Start:
- 437
- Page End:
- 446
- Publication Date:
- 2017-06
- Subjects:
- Natural images -- Text/non-text image classification -- Convolutional neural network -- Multi-scale spatial partition
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2016.12.005 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1029.xml