A systematic review of validity evidence for checklists versus global rating scales in simulation‐based assessment. Issue 2 (February 2015)
- Record Type:
- Journal Article
- Title:
- A systematic review of validity evidence for checklists versus global rating scales in simulation‐based assessment. Issue 2 (February 2015)
- Main Title:
- A systematic review of validity evidence for checklists versus global rating scales in simulation‐based assessment
- Authors:
- Ilgen, Jonathan S
Ma, Irene W Y
Hatala, Rose
Cook, David A - Abstract:
- <abstract abstract-type="main" id="medu12621-abs-0001"> <title> <x xml:space="preserve">Abstract</x> </title> <sec id="medu12621-sec-0001" sec-type="section"> <title>Context</title> <p>The relative advantages and disadvantages of checklists and global rating scales (GRSs) have long been debated. To compare the merits of these scale types, we conducted a systematic review of the validity evidence for checklists and GRSs in the context of simulation‐based assessment of health professionals.</p> </sec> <sec id="medu12621-sec-0002" sec-type="section"> <title>Methods</title> <p>We conducted a systematic review of multiple databases including MEDLINE, EMBASE and Scopus to February 2013. We selected studies that used both a GRS and checklist in the simulation‐based assessment of health professionals. Reviewers working in duplicate evaluated five domains of validity evidence, including correlation between scales and reliability. We collected information about raters, instrument characteristics, assessment context, and task. We pooled reliability and correlation coefficients using random‐effects meta‐analysis.</p> </sec> <sec id="medu12621-sec-0003" sec-type="section"> <title>Results</title> <p>We found 45 studies that used a checklist and GRS in simulation‐based assessment. All studies included physicians or physicians in training; one study also included nurse anaesthetists. Topics of assessment included open and laparoscopic surgery (<italic>n</italic> = 22), endoscopy<abstract abstract-type="main" id="medu12621-abs-0001"> <title> <x xml:space="preserve">Abstract</x> </title> <sec id="medu12621-sec-0001" sec-type="section"> <title>Context</title> <p>The relative advantages and disadvantages of checklists and global rating scales (GRSs) have long been debated. To compare the merits of these scale types, we conducted a systematic review of the validity evidence for checklists and GRSs in the context of simulation‐based assessment of health professionals.</p> </sec> <sec id="medu12621-sec-0002" sec-type="section"> <title>Methods</title> <p>We conducted a systematic review of multiple databases including MEDLINE, EMBASE and Scopus to February 2013. We selected studies that used both a GRS and checklist in the simulation‐based assessment of health professionals. Reviewers working in duplicate evaluated five domains of validity evidence, including correlation between scales and reliability. We collected information about raters, instrument characteristics, assessment context, and task. We pooled reliability and correlation coefficients using random‐effects meta‐analysis.</p> </sec> <sec id="medu12621-sec-0003" sec-type="section"> <title>Results</title> <p>We found 45 studies that used a checklist and GRS in simulation‐based assessment. All studies included physicians or physicians in training; one study also included nurse anaesthetists. Topics of assessment included open and laparoscopic surgery (<italic>n</italic> = 22), endoscopy (<italic>n</italic> = 8), resuscitation (<italic>n</italic> = 7) and anaesthesiology (<italic>n</italic> = 4). The pooled GRS–checklist correlation was 0.76 (95% confidence interval [CI] 0.69–0.81, <italic>n</italic> = 16 studies). Inter‐rater reliability was similar between scales (GRS 0.78, 95% CI 0.71–0.83, <italic>n</italic> = 23; checklist 0.81, 95% CI 0.75–0.85, <italic>n</italic> = 21), whereas GRS inter‐item reliabilities (0.92, 95% CI 0.84–0.95, <italic>n</italic> = 6) and inter‐station reliabilities (0.80, 95% CI 0.73–0.85, <italic>n</italic> = 10) were higher than those for checklists (0.66, 95% CI 0–0.84, <italic>n</italic> = 4 and 0.69, 95% CI 0.56–0.77, <italic>n</italic> = 10, respectively). Content evidence for GRSs usually referenced previously reported instruments (<italic>n</italic> = 33), whereas content evidence for checklists usually described expert consensus (<italic>n</italic> = 26). Checklists and GRSs usually had similar evidence for relations to other variables.</p> </sec> <sec id="medu12621-sec-0004" sec-type="section"> <title>Conclusions</title> <p>Checklist inter‐rater reliability and trainee discrimination were more favourable than suggested in earlier work, but each task requires a separate checklist. Compared with the checklist, the GRS has higher average inter‐item and inter‐station reliability, can be used across multiple tasks, and may better capture nuanced elements of expertise.</p> </sec> </abstract> … (more)
- Is Part Of:
- Medical education. Volume 49:Issue 2(2015)
- Journal:
- Medical education
- Issue:
- Volume 49:Issue 2(2015)
- Issue Display:
- Volume 49, Issue 2 (2015)
- Year:
- 2015
- Volume:
- 49
- Issue:
- 2
- Issue Sort Value:
- 2015-0049-0002-0000
- Page Start:
- 161
- Page End:
- 173
- Publication Date:
- 2015-02
- Subjects:
- Medical education -- Periodicals
Medical education -- Great Britain -- Periodicals
610.7 - Journal URLs:
- http://www.blackwell-synergy.com/member/institutions/issuelist.asp?journal=med ↗
http://www.blackwellpublishing.com/journal.asp?ref=0308-0110 ↗
http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1365-2923 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/medu.12621 ↗
- Languages:
- English
- ISSNs:
- 0308-0110
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5527.166000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 3252.xml