Validation of a machine learning approach to estimate Systemic Lupus Erythematosus Disease Activity Index score categories and application in a real-world dataset. Issue 2 (20th May 2021)
- Record Type:
- Journal Article
- Title:
- Validation of a machine learning approach to estimate Systemic Lupus Erythematosus Disease Activity Index score categories and application in a real-world dataset. Issue 2 (20th May 2021)
- Main Title:
- Validation of a machine learning approach to estimate Systemic Lupus Erythematosus Disease Activity Index score categories and application in a real-world dataset
- Authors:
- Alves, Pedro
Bandaria, Jigar
Leavy, Michelle B
Gliklich, Benjamin
Boussios, Costas
Su, Zhaohui
Curhan, Gary - Abstract:
- Abstract : Objective: Use of the Systemic Lupus Erythematosus Disease Activity Index (SLEDAI) in routine clinical practice is inconsistent, and availability of clinician-recorded SLEDAI scores in real-world datasets is limited. This study aimed to validate a machine learning model to estimate SLEDAI score categories using clinical notes and to apply the model to a large, real-world dataset to generate estimated score categories for use in future research studies. Methods: A machine learning model was developed to estimate an individual patient's SLEDAI score category (no activity, mild activity, moderate activity or high/very high activity) for a specific encounter date using clinical notes. A training cohort of 3504 encounters and a separate validation cohort of 1576 encounters were created from the OM1 SLE Registry. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), calculated using a binarised version of the outcome that sets the positive class to be those records with clinician-recorded SLEDAI scores >5 and the negative class to be records with scores ≤5. Model performance was evaluated by categorising the scores into the four disease activity categories and by calculating the Spearman's R value and Pearson's R value. Results: The AUC for the two categories was 0.93 for the development cohort and 0.91 for the validation cohort. The model had a Spearman's R value of 0.7 and a Pearson's R value of 0.7 when calculatedAbstract : Objective: Use of the Systemic Lupus Erythematosus Disease Activity Index (SLEDAI) in routine clinical practice is inconsistent, and availability of clinician-recorded SLEDAI scores in real-world datasets is limited. This study aimed to validate a machine learning model to estimate SLEDAI score categories using clinical notes and to apply the model to a large, real-world dataset to generate estimated score categories for use in future research studies. Methods: A machine learning model was developed to estimate an individual patient's SLEDAI score category (no activity, mild activity, moderate activity or high/very high activity) for a specific encounter date using clinical notes. A training cohort of 3504 encounters and a separate validation cohort of 1576 encounters were created from the OM1 SLE Registry. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), calculated using a binarised version of the outcome that sets the positive class to be those records with clinician-recorded SLEDAI scores >5 and the negative class to be records with scores ≤5. Model performance was evaluated by categorising the scores into the four disease activity categories and by calculating the Spearman's R value and Pearson's R value. Results: The AUC for the two categories was 0.93 for the development cohort and 0.91 for the validation cohort. The model had a Spearman's R value of 0.7 and a Pearson's R value of 0.7 when calculated using the four disease activity categories. Conclusion: The model performs well when estimating SLEDAI score categories using unstructured clinical notes. … (more)
- Is Part Of:
- RMD open. Volume 7:Issue 2(2021)
- Journal:
- RMD open
- Issue:
- Volume 7:Issue 2(2021)
- Issue Display:
- Volume 7, Issue 2 (2021)
- Year:
- 2021
- Volume:
- 7
- Issue:
- 2
- Issue Sort Value:
- 2021-0007-0002-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-05-20
- Subjects:
- lupus erythematosus -- systemic -- outcome assessment -- healthcare -- epidemiology
Musculoskeletal system -- Diseases -- Periodicals
Rheumatism -- Periodicals
616.7005 - Journal URLs:
- http://www.bmj.com/archive ↗
http://rmdopen.bmj.com/ ↗ - DOI:
- 10.1136/rmdopen-2021-001586 ↗
- Languages:
- English
- ISSNs:
- 2056-5933
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17175.xml