Chart decoder: Generating textual and numeric information from chart images automatically. (October 2018)
- Record Type:
- Journal Article
- Title:
- Chart decoder: Generating textual and numeric information from chart images automatically. (October 2018)
- Main Title:
- Chart decoder: Generating textual and numeric information from chart images automatically
- Authors:
- Dai, Wenjing
Wang, Meng
Niu, Zhibin
Zhang, Jiawan - Abstract:
- Abstract: Charts are commonly used as a graphical representation for visualizing numerical data in digital documents. For many legacy charts or scientific charts, however, underlying data is not available, which hinders the process of redesigning more effective visualizations and further analysis of charts. In response, we present Chart Decoder, a system that implements decoding of visual features and recovers data from chart images. Chart Decoder takes a chart image as input and generates the textual and numeric information of that chart image as output through applying deep learning, computer vision and text recognition techniques. We train a deep learning based classifier to identify chart types of five categories (bar chart, pie chart, line chart, scatter plot and radar chart), which achieves a classification accuracy over 99%. We also complement a textual information extraction pipeline which detects text regions in a chart, recognizes text content and distinguishes their roles. For generating textual and graphical information, we implement automated data recovery from bar charts, one of the most popular chart types. To evaluate the effectiveness of our algorithms, we evaluate our system on two corpora: 1) bar charts collected from the web, 2) charts randomly made by a script. The results demonstrate that our system is able to recover data from bar charts with a high rate of accuracy.
- Is Part Of:
- Journal of visual languages & computing. Volume 48(2018)
- Journal:
- Journal of visual languages & computing
- Issue:
- Volume 48(2018)
- Issue Display:
- Volume 48, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 48
- Issue:
- 2018
- Issue Sort Value:
- 2018-0048-2018-0000
- Page Start:
- 101
- Page End:
- 109
- Publication Date:
- 2018-10
- Subjects:
- Visualization -- Computer vision -- Text recognition -- Information extraction
Visual programming languages (Computer science) -- Periodicals
Visual programming (Computer science) -- Periodicals
Programming languages (Electronic computers) -- Semantics -- Periodicals
Langages de programmation visuelle -- Périodiques
Programmation visuelle -- Périodiques
Langages de programmation -- Sémantique -- Périodiques
Programming languages (Electronic computers) -- Semantics
Visual programming (Computer science)
Visual programming languages (Computer science)
Periodicals
Electronic journals
005 - Journal URLs:
- http://www.sciencedirect.com/science/journal/1045926X ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.jvlc.2018.08.005 ↗
- Languages:
- English
- ISSNs:
- 1045-926X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5072.495200
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23144.xml