Comparing machine and human reviewers to evaluate the risk of bias in randomized controlled trials. (3rd March 2020)
- Record Type:
- Journal Article
- Title:
- Comparing machine and human reviewers to evaluate the risk of bias in randomized controlled trials. (3rd March 2020)
- Main Title:
- Comparing machine and human reviewers to evaluate the risk of bias in randomized controlled trials
- Authors:
- Armijo‐Olivo, Susan
Craig, Rodger
Campbell, Sandy - Abstract:
- Abstract : Background: Evidence from new health technologies is growing, along with demands for evidence to inform policy decisions, creating challenges in completing health technology assessments (HTAs)/systematic reviews (SRs) in a timely manner. Software can decrease the time and burden by automating the process, but evidence validating such software is limited. We tested the accuracy of RobotReviewer, a semi‐autonomous risk of bias (RoB) assessment tool, and its agreement with human reviewers. Methods: Two reviewers independently conducted RoB assessments on a sample of randomized controlled trials (RCTs), and their consensus ratings were compared with those generated by RobotReviewer. Agreement with the human reviewers was assessed using percent agreement and weighted kappa ( κ ). The accuracy of RobotReviewer was also assessed by calculating the sensitivity, specificity, and area under the curve in comparison to the consensus agreement of the human reviewers. Results: The study included 372 RCTs. Inter‐rater reliability ranged from κ = −0.06 (no agreement) for blinding of participants and personnel to κ = 0.62 (good agreement) for random sequence generation (excluding overall RoB). RobotReviewer was found to use a high percentage of "irrelevant supporting quotations" to complement RoB assessments for blinding of participants and personnel (72.6%), blinding of outcome assessment (70.4%), and allocation concealment (54.3%). Conclusion: RobotReviewer can help with risk ofAbstract : Background: Evidence from new health technologies is growing, along with demands for evidence to inform policy decisions, creating challenges in completing health technology assessments (HTAs)/systematic reviews (SRs) in a timely manner. Software can decrease the time and burden by automating the process, but evidence validating such software is limited. We tested the accuracy of RobotReviewer, a semi‐autonomous risk of bias (RoB) assessment tool, and its agreement with human reviewers. Methods: Two reviewers independently conducted RoB assessments on a sample of randomized controlled trials (RCTs), and their consensus ratings were compared with those generated by RobotReviewer. Agreement with the human reviewers was assessed using percent agreement and weighted kappa ( κ ). The accuracy of RobotReviewer was also assessed by calculating the sensitivity, specificity, and area under the curve in comparison to the consensus agreement of the human reviewers. Results: The study included 372 RCTs. Inter‐rater reliability ranged from κ = −0.06 (no agreement) for blinding of participants and personnel to κ = 0.62 (good agreement) for random sequence generation (excluding overall RoB). RobotReviewer was found to use a high percentage of "irrelevant supporting quotations" to complement RoB assessments for blinding of participants and personnel (72.6%), blinding of outcome assessment (70.4%), and allocation concealment (54.3%). Conclusion: RobotReviewer can help with risk of bias assessment of RCTs but cannot replace human evaluations. Thus, reviewers should check and validate RoB assessments from RobotReviewer by consulting the original article when not relevant supporting quotations are provided by RobotReviewer. This consultation is in line with the recommendation provided by the developers. … (more)
- Is Part Of:
- Research synthesis methods. Volume 11:Number 3(2020)
- Journal:
- Research synthesis methods
- Issue:
- Volume 11:Number 3(2020)
- Issue Display:
- Volume 11, Issue 3 (2020)
- Year:
- 2020
- Volume:
- 11
- Issue:
- 3
- Issue Sort Value:
- 2020-0011-0003-0000
- Page Start:
- 484
- Page End:
- 493
- Publication Date:
- 2020-03-03
- Subjects:
- artificial intelligence -- health technology assessment (HTA) -- inter‐rater reliability -- randomized controlled trial -- risk of bias -- systematic review
Research -- Methodology -- Periodicals
Research -- Statistical methods -- Periodicals
507.2 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1759-2887 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/jrsm.1398 ↗
- Languages:
- English
- ISSNs:
- 1759-2879
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 7773.705700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 23091.xml