Understanding Mean Score Differences Between the e‐rater® Automated Scoring Engine and Humans for Demographically Based Groups in the GRE® General Test. Issue 1 (27th April 2018)

Record Type:: Journal Article
Title:: Understanding Mean Score Differences Between the e‐rater® Automated Scoring Engine and Humans for Demographically Based Groups in the GRE® General Test. Issue 1 (27th April 2018)
Main Title:: Understanding Mean Score Differences Between the e‐rater® Automated Scoring Engine and Humans for Demographically Based Groups in the GRE® General Test
Authors:: Ramineni, Chaitanya
Williamson, David
Abstract:: Abstract: Notable mean score differences for the e‐rater ® automated scoring engine and for humans for essays from certain demographic groups were observed for the GRE ® General Test in use before the major revision of 2012, called rGRE. The use of e‐rater as a check‐score model with discrepancy thresholds prevented an adverse impact on the examinee score at the item or test level. Despite this control, there remains a need to understand the root causes of these demographically based score differences and to identify potential mechanisms for avoiding future instances of discrepancy. In this study, we used a combination of statistical methods and human review to propose hypotheses about the root cause of score differences and whether such discrepancies reflect inadequacies of e‐rater, human scoring, or both. The human rating process was found to be influenced strongly by the scale structure and did not fully correspond to the e‐rater scoring mechanism. The human raters appeared to be using conditional logic and a rule‐based approach to their scoring, while e‐rater uses linear weighting of all the features. These analyses have implications for future research and operational policies for the scoring of the rGRE. Abstract : Report Number: ETS RR–18‐12
Is Part Of:: ETS research report series. Issue 1(2018)
Journal:: ETS research report series
Issue:: Issue 1(2018)
Issue Display:: Volume 1, Issue 1 (2018)
Year:: 2018
Volume:: 1
Issue:: 1
Issue Sort Value:: 2018-0001-0001-0000
Page Start:: 1
Page End:: 31
Publication Date:: 2018-04-27
Subjects:: Automated scoring -- essay scoring -- GRE® writing -- subgroup differences -- shell text -- CART
Universities and colleges -- Entrance examinations
Universities and colleges -- Graduate work -- Examinations
Universities and colleges -- United States -- Entrance examinations
Universities and colleges -- United States -- Graduate work -- Examinations
Graduate Record Examination
Educational tests and measurements
Social sciences
Education -- Research
Education -- Research
Educational tests and measurements
Graduate Record Examination
Social sciences
Universities and colleges -- Entrance examinations
Universities and colleges -- Graduate work -- Examinations
United States
378
Journal URLs:: http://www.ets.org/research/policy_research_reports/ets ↗
http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)2330-8516 ↗
http://onlinelibrary.wiley.com/ ↗
DOI:: 10.1002/ets2.12192 ↗
Languages:: English
ISSNs:: 2330-8516
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 17596.xml