Automatic tracing and extraction of text‐line and word segments directly in JPEG compressed document images. Issue 9 (10th June 2020)
- Record Type:
- Journal Article
- Title:
- Automatic tracing and extraction of text‐line and word segments directly in JPEG compressed document images. Issue 9 (10th June 2020)
- Main Title:
- Automatic tracing and extraction of text‐line and word segments directly in JPEG compressed document images
- Authors:
- Rajesh, Bulla
Javed, Mohammed
Nagabhushan, P. - Abstract:
- Abstract : JPEG is one of the popular and efficient compression algorithms supported in the consumer electronics world. Excessive usage of mobile phones and e‐governance applications have all resulted in a huge collection of JPEG compressed document images. The major challenge with these images is that its processing becomes expensive as it requires repeated decompression and recompression operations. Recently, it has been proved that developing algorithms to operate directly on the compressed data is one of the solutions in overcoming the above issue. This research study investigates a novel algorithm for segmentation of text‐lines and words directly from JPEG compressed handwritten document images. Segmenting a handwritten document is challenging due to the presence of uneven spacing, variable font sizes, overlapping and touching components, and it becomes much more challenging if it is to be done directly in the compressed image. The proposed technique virtually fixes a vertical stripe at the beginning of the document to detect starting points of text‐lines. Then a moving window‐based space penetration algorithm is used for tracing the exact line boundary between two text‐lines, resolving the issues of space and font variations, touching and overlapping components. Subsequently, a word boundary tracing algorithm is used to segment words.
- Is Part Of:
- IET image processing. Volume 14:Issue 9(2020)
- Journal:
- IET image processing
- Issue:
- Volume 14:Issue 9(2020)
- Issue Display:
- Volume 14, Issue 9 (2020)
- Year:
- 2020
- Volume:
- 14
- Issue:
- 9
- Issue Sort Value:
- 2020-0014-0009-0000
- Page Start:
- 1909
- Page End:
- 1919
- Publication Date:
- 2020-06-10
- Subjects:
- digital libraries -- image coding -- image segmentation -- feature extraction -- document image processing -- data compression -- text detection
text‐line -- word segments -- JPEG -- Joint Photographic Experts Group -- popular compression algorithms -- efficient compression algorithms -- consumer electronics world -- excessive usage -- mobile phones -- compressed image -- decompression -- recompression operations -- compressed data -- handwritten document images -- overlapping components -- touching components -- moving window‐based space penetration algorithm -- exact line boundary -- word boundary tracing algorithm -- segment words -- spatial domains -- compressed domains
Image processing -- Periodicals
621.36705 - Journal URLs:
- http://digital-library.theiet.org/content/journals/iet-ipr ↗
http://ieeexplore.ieee.org/servlet/opac?punumber=4149689 ↗
http://www.ietdl.org/IET-IPR ↗
https://ietresearch.onlinelibrary.wiley.com/journal/17519667 ↗
http://www.theiet.org/ ↗ - DOI:
- 10.1049/iet-ipr.2019.1437 ↗
- Languages:
- English
- ISSNs:
- 1751-9659
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4363.252600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16599.xml