Competency-Based Assessments: Leveraging Artificial Intelligence to Predict Subcompetency Content. (5th December 2022)
- Record Type:
- Journal Article
- Title:
- Competency-Based Assessments: Leveraging Artificial Intelligence to Predict Subcompetency Content. (5th December 2022)
- Main Title:
- Competency-Based Assessments: Leveraging Artificial Intelligence to Predict Subcompetency Content
- Authors:
- Booth, Gregory J.
Ross, Benjamin
Cronin, William A.
McElrath, Angela
Cyr, Kyle L.
Hodgson, John A.
Sibley, Charles
Ismawan, J. Martin
Zuehl, Alyssa
Slotto, James G.
Higgs, Maureen
Haldeman, Matthew
Geiger, Phillip
Jardine, Dink - Abstract:
- Abstract : Purpose: Faculty feedback on trainees is critical to guiding trainee progress in a competency-based medical education framework. The authors aimed to develop and evaluate a Natural Language Processing (NLP) algorithm that automatically categorizes narrative feedback into corresponding Accreditation Council for Graduate Medical Education Milestone 2.0 subcompetencies. Method: Ten academic anesthesiologists analyzed 5, 935 narrative evaluations on anesthesiology trainees at 4 graduate medical education (GME) programs between July 1, 2019, and June 30, 2021. Each sentence (n = 25, 714) was labeled with the Milestone 2.0 subcompetency that best captured its content or was labeled as demographic or not useful. Inter-rater agreement was assessed by Fleiss' Kappa. The authors trained an NLP model to predict feedback subcompetencies using data from 3 sites and evaluated its performance at a fourth site. Performance metrics included area under the receiver operating characteristic curve (AUC), positive predictive value, sensitivity, F1, and calibration curves. The model was implemented at 1 site in a self-assessment exercise. Results: Fleiss' Kappa for subcompetency agreement was moderate (0.44). Model performance was good for professionalism, interpersonal and communication skills, and practice-based learning and improvement (AUC 0.79, 0.79, and 0.75, respectively). Subcompetencies within medical knowledge and patient care ranged from fair to excellent (AUC 0.66–0.84 andAbstract : Purpose: Faculty feedback on trainees is critical to guiding trainee progress in a competency-based medical education framework. The authors aimed to develop and evaluate a Natural Language Processing (NLP) algorithm that automatically categorizes narrative feedback into corresponding Accreditation Council for Graduate Medical Education Milestone 2.0 subcompetencies. Method: Ten academic anesthesiologists analyzed 5, 935 narrative evaluations on anesthesiology trainees at 4 graduate medical education (GME) programs between July 1, 2019, and June 30, 2021. Each sentence (n = 25, 714) was labeled with the Milestone 2.0 subcompetency that best captured its content or was labeled as demographic or not useful. Inter-rater agreement was assessed by Fleiss' Kappa. The authors trained an NLP model to predict feedback subcompetencies using data from 3 sites and evaluated its performance at a fourth site. Performance metrics included area under the receiver operating characteristic curve (AUC), positive predictive value, sensitivity, F1, and calibration curves. The model was implemented at 1 site in a self-assessment exercise. Results: Fleiss' Kappa for subcompetency agreement was moderate (0.44). Model performance was good for professionalism, interpersonal and communication skills, and practice-based learning and improvement (AUC 0.79, 0.79, and 0.75, respectively). Subcompetencies within medical knowledge and patient care ranged from fair to excellent (AUC 0.66–0.84 and 0.63–0.88, respectively). Performance for systems-based practice was poor (AUC 0.59). Performances for demographic and not useful categories were excellent (AUC 0.87 for both). In approximately 1 minute, the model interpreted several hundred evaluations and produced individual trainee reports with organized feedback to guide a self-assessment exercise. The model was built into a web-based application. Conclusions: The authors developed an NLP model that recognized the feedback language of anesthesiologists across multiple GME programs. The model was operationalized in a self-assessment exercise. It is a powerful tool which rapidly organizes large amounts of narrative feedback. … (more)
- Is Part Of:
- Academic medicine. Volume 98:Number 4(2023)
- Journal:
- Academic medicine
- Issue:
- Volume 98:Number 4(2023)
- Issue Display:
- Volume 98, Issue 4 (2023)
- Year:
- 2023
- Volume:
- 98
- Issue:
- 4
- Issue Sort Value:
- 2023-0098-0004-0000
- Page Start:
- 497
- Page End:
- 504
- Publication Date:
- 2022-12-05
- Subjects:
- Medical education -- Periodicals
Medical policy -- Periodicals
Medical personnel -- Periodicals
Periodicals
610.711 - Journal URLs:
- http://gateway.ovid.com/ovidweb.cgi?T=JS&MODE=ovid&PAGE=toc&D=ovft&AN=00001888-000000000-00000 ↗
http://www.academicmedicine.org ↗
http://www.academicmedicine.org/contents-by-date.0.shtml ↗
http://journals.lww.com ↗ - DOI:
- 10.1097/ACM.0000000000005115 ↗
- Languages:
- English
- ISSNs:
- 1040-2446
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0570.513500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26797.xml