Applying Big Data visualization to detect trends in 30 years of performance reports. (October 2020)
- Record Type:
- Journal Article
- Title:
- Applying Big Data visualization to detect trends in 30 years of performance reports. (October 2020)
- Main Title:
- Applying Big Data visualization to detect trends in 30 years of performance reports
- Authors:
- Raveh, Eran
Ofek, Yuval
Bekkerman, Ron
Cohen, Hertzel - Abstract:
- Evaluators worldwide are dealing with a growing amount of unstructured electronic data, predominantly in textual format. Currently, evaluators analyze textual Big Data primarily using traditional content analysis methods based on keyword search, a practice that is limited to iterating over predefined concepts. But what if evaluators cannot define the necessary keywords for their analysis? Often we should examine trends in the way certain organizations have been operating, while our raw data are gigabytes of documents generated by that organization over decades. The problem is that in many cases we do not know what exactly we need to look for. In such cases, traditional analytical machinery would not provide an adequate solution within reasonable time—instead, heavy-lifting Big Data Science should be applied. We propose an automated, quantitative, user-friendly methodology based on text mining, machine learning, and data visualization, which assists researchers and evaluation practitioners to reveal trends, trajectories, and interrelations between bits and pieces of textual information in order to support evaluation. Our system automatically extracts a large amount of descriptive terminology for a particular domain in a given language, finds semantic connections between documents based on the extracted terminology, visualizes the entire document repository as a graph of semantic connections, and leads the user to the areas on that graph where most interesting trends can beEvaluators worldwide are dealing with a growing amount of unstructured electronic data, predominantly in textual format. Currently, evaluators analyze textual Big Data primarily using traditional content analysis methods based on keyword search, a practice that is limited to iterating over predefined concepts. But what if evaluators cannot define the necessary keywords for their analysis? Often we should examine trends in the way certain organizations have been operating, while our raw data are gigabytes of documents generated by that organization over decades. The problem is that in many cases we do not know what exactly we need to look for. In such cases, traditional analytical machinery would not provide an adequate solution within reasonable time—instead, heavy-lifting Big Data Science should be applied. We propose an automated, quantitative, user-friendly methodology based on text mining, machine learning, and data visualization, which assists researchers and evaluation practitioners to reveal trends, trajectories, and interrelations between bits and pieces of textual information in order to support evaluation. Our system automatically extracts a large amount of descriptive terminology for a particular domain in a given language, finds semantic connections between documents based on the extracted terminology, visualizes the entire document repository as a graph of semantic connections, and leads the user to the areas on that graph where most interesting trends can be observed. This article exemplifies the new method on 1700 performance reports, showing that the method can be used successfully, supplying evaluators with highly important information which cannot be revealed using other methods. Such exploratory exercise is vital as a preliminary exploratory phase for evaluations involving unstructured Big Data, after which a range of evaluation methods can be applied. We argue that our system can be successfully implemented on any domain evaluated. … (more)
- Is Part Of:
- Evaluation. Volume 26:Number 4(2020:Oct.)
- Journal:
- Evaluation
- Issue:
- Volume 26:Number 4(2020:Oct.)
- Issue Display:
- Volume 26, Issue 4 (2020)
- Year:
- 2020
- Volume:
- 26
- Issue:
- 4
- Issue Sort Value:
- 2020-0026-0004-0000
- Page Start:
- 516
- Page End:
- 540
- Publication Date:
- 2020-10
- Subjects:
- Big Data -- evaluation -- machine learning -- trends -- unstructured data -- visualization
Evaluation research (Social action programs) -- Periodicals
361.61068 - Journal URLs:
- http://evi.sagepub.com/ ↗
http://oxfordsfx-direct.hosted.exlibrisgroup.com/oxford?url%5Fver=Z39.88-2004&ctx%5Fver=Z39.88-2004&ctx%5Fenc=info:ofi/enc:UTF-8&rfr%5Fid=info:sid/sfxit.com:opac%5F856&url%5Fctx%5Ffmt=info:ofi/fmt:kev:mtx:ctx&sfx.ignore%5Fdate%5Fthreshold=1&rft.object%5Fid=960238624255&svc%5Fval%5Ffmt=info:ofi/fmt:kev:mtx:sch%5Fsvc& ↗
http://www.uk.sagepub.com/home.nav ↗ - DOI:
- 10.1177/1356389020905322 ↗
- Languages:
- English
- ISSNs:
- 1356-3890
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14057.xml