Open-vocabulary recognition of machine-printed Arabic text using hidden Markov models. (March 2016)
- Record Type:
- Journal Article
- Title:
- Open-vocabulary recognition of machine-printed Arabic text using hidden Markov models. (March 2016)
- Main Title:
- Open-vocabulary recognition of machine-printed Arabic text using hidden Markov models
- Authors:
- Ahmad, Irfan
Mahmoud, Sabri A.
Fink, Gernot A. - Abstract:
- Abstract: In this paper, we present multi-font printed Arabic text recognition using hidden Markov models (HMMs). We propose a novel approach to the sliding window technique for feature extraction. The size and position of the cells of the sliding window adapt to the writing line of Arabic text and ink-pixel distributions. We employ a two-step approach for mixed-font text recognition, in which the input text line image is associated with the closest known font in the first step, using simple and effective features for font identification. The text line is subsequently recognized by the recognizer that was trained for the particular font in the next step. This approach proves to be more effective than text recognition using a recognizer trained on samples from multiple fonts. We also present a framework for the recognition of unseen fonts, which employs font association and HMM adaptation techniques. Experiments were conducted using two separate databases of printed Arabic text to demonstrate the effectiveness of the presented techniques. The presented techniques can be easily adapted to other scripts, such as Roman script. Highlights: A novel approach to the sliding window technique for feature extraction. A two-step approach to mixed-font and unseen font text recognition. Simple and effective features for font identification. A multi-font printed Arabic text database for text recognition research. Experiments were conducted using two separate databases of printed ArabicAbstract: In this paper, we present multi-font printed Arabic text recognition using hidden Markov models (HMMs). We propose a novel approach to the sliding window technique for feature extraction. The size and position of the cells of the sliding window adapt to the writing line of Arabic text and ink-pixel distributions. We employ a two-step approach for mixed-font text recognition, in which the input text line image is associated with the closest known font in the first step, using simple and effective features for font identification. The text line is subsequently recognized by the recognizer that was trained for the particular font in the next step. This approach proves to be more effective than text recognition using a recognizer trained on samples from multiple fonts. We also present a framework for the recognition of unseen fonts, which employs font association and HMM adaptation techniques. Experiments were conducted using two separate databases of printed Arabic text to demonstrate the effectiveness of the presented techniques. The presented techniques can be easily adapted to other scripts, such as Roman script. Highlights: A novel approach to the sliding window technique for feature extraction. A two-step approach to mixed-font and unseen font text recognition. Simple and effective features for font identification. A multi-font printed Arabic text database for text recognition research. Experiments were conducted using two separate databases of printed Arabic text. … (more)
- Is Part Of:
- Pattern recognition. Volume 51(2016:Mar.)
- Journal:
- Pattern recognition
- Issue:
- Volume 51(2016:Mar.)
- Issue Display:
- Volume 51 (2016)
- Year:
- 2016
- Volume:
- 51
- Issue Sort Value:
- 2016-0051-0000-0000
- Page Start:
- 97
- Page End:
- 111
- Publication Date:
- 2016-03
- Subjects:
- Optical character recognition -- Mixed-font OCR -- Unseen-font OCR -- Hidden Markov models -- Font identification -- Sliding window -- Arabic OCR
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2015.09.011 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 59.xml