A systematic review of methods for evaluating rating quality in language assessment. (April 2018)
- Record Type:
- Journal Article
- Title:
- A systematic review of methods for evaluating rating quality in language assessment. (April 2018)
- Main Title:
- A systematic review of methods for evaluating rating quality in language assessment
- Authors:
- Wind, Stefanie A.
Peterson, Meghan E. - Abstract:
- The use of assessments that require rater judgment (i.e., rater-mediated assessments) has become increasingly popular in high-stakes language assessments worldwide. Using a systematic literature review, the purpose of this study is to identify and explore the dominant methods for evaluating rating quality within the context of research on large-scale rater-mediated language assessments. Results from the review of 259 methodological and applied studies reveal an emphasis on inter-rater reliability as evidence of rating quality that persists across methodological and applied studies, studies primarily focused on rating quality and studies not primarily focused on rating quality, and across multiple language constructs. Additional findings suggest discrepancies in rating designs used in empirical research and practical concerns in performance assessment systems. Taken together, the findings from this study highlight the reliance upon aggregate-level information that is not specific to individual raters or specific facets of an assessment context as evidence of rating quality in rater-mediated assessments. In order to inform the interpretation and use of ratings, as well as the improvement of rater-mediated assessment systems, rating quality indices are needed that go beyond group-level indicators of inter-rater reliability, and provide diagnostic evidence of rating quality specific to individual raters, students, and other facets of the assessment system. These indicators areThe use of assessments that require rater judgment (i.e., rater-mediated assessments) has become increasingly popular in high-stakes language assessments worldwide. Using a systematic literature review, the purpose of this study is to identify and explore the dominant methods for evaluating rating quality within the context of research on large-scale rater-mediated language assessments. Results from the review of 259 methodological and applied studies reveal an emphasis on inter-rater reliability as evidence of rating quality that persists across methodological and applied studies, studies primarily focused on rating quality and studies not primarily focused on rating quality, and across multiple language constructs. Additional findings suggest discrepancies in rating designs used in empirical research and practical concerns in performance assessment systems. Taken together, the findings from this study highlight the reliance upon aggregate-level information that is not specific to individual raters or specific facets of an assessment context as evidence of rating quality in rater-mediated assessments. In order to inform the interpretation and use of ratings, as well as the improvement of rater-mediated assessment systems, rating quality indices are needed that go beyond group-level indicators of inter-rater reliability, and provide diagnostic evidence of rating quality specific to individual raters, students, and other facets of the assessment system. These indicators are available based on modern measurement techniques, such as Rasch measurement theory and other item response theory approaches. Implications are discussed as they relate to validity, reliability/precision, and fairness for rater-mediated assessments. … (more)
- Is Part Of:
- Language testing. Volume 35:Number 2(2018)
- Journal:
- Language testing
- Issue:
- Volume 35:Number 2(2018)
- Issue Display:
- Volume 35, Issue 2 (2018)
- Year:
- 2018
- Volume:
- 35
- Issue:
- 2
- Issue Sort Value:
- 2018-0035-0002-0000
- Page Start:
- 161
- Page End:
- 192
- Publication Date:
- 2018-04
- Subjects:
- Language assessment -- rater effects -- rater-mediated assessment -- rating quality -- raters
Language and languages -- Ability testing -- Periodicals
Language and languages -- Examinations -- Periodicals
407.6 - Journal URLs:
- http://ltj.sagepub.com ↗
http://www.uk.sagepub.com/home.nav ↗ - DOI:
- 10.1177/0265532216686999 ↗
- Languages:
- English
- ISSNs:
- 0265-5322
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8063.xml