A Machine Learning-Based Model to Evaluate Readability and Assess Grade Level for the Web Pages. (7th September 2020)
- Record Type:
- Journal Article
- Title:
- A Machine Learning-Based Model to Evaluate Readability and Assess Grade Level for the Web Pages. (7th September 2020)
- Main Title:
- A Machine Learning-Based Model to Evaluate Readability and Assess Grade Level for the Web Pages
- Authors:
- Pantula, Muralidhar
Kuppusamy, K S - Abstract:
- Abstract: Evaluating readability of web documents has gained attention due to several factors such as improving the effectiveness of writing and to reach a wider spectrum of audience. Current practices in this direction follow several statistical measures in evaluating readability of the document. In this paper, we have proposed a machine learning-based model to compute readability of web pages. The minimum educational standards required (grade level) to understand the contents of a web page are also computed. The proposed model classifies the web pages into highly readable, readable or less readable using specified feature set. To classify a web page with the aforementioned categories, we have incorporated the features such as sentence count, word count, syllable count, type-token ratio and lexical ambiguity. To increase the usability of the proposed model, we have developed an accessible browser extension to perform the assessments of every web page loaded into the browser.
- Is Part Of:
- Computer journal. Volume 65:Number 4(2022)
- Journal:
- Computer journal
- Issue:
- Volume 65:Number 4(2022)
- Issue Display:
- Volume 65, Issue 4 (2022)
- Year:
- 2022
- Volume:
- 65
- Issue:
- 4
- Issue Sort Value:
- 2022-0065-0004-0000
- Page Start:
- 831
- Page End:
- 842
- Publication Date:
- 2020-09-07
- Subjects:
- readability measure -- machine learning -- web page readability estimator -- browser extension
Computers -- Periodicals
005.1 - Journal URLs:
- http://comjnl.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/comjnl/bxaa113 ↗
- Languages:
- English
- ISSNs:
- 0010-4620
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.060000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21290.xml