Imputation and characterization of uncoded self-harm in major mental illness using machine learning. (24th October 2019)

Record Type:: Journal Article
Title:: Imputation and characterization of uncoded self-harm in major mental illness using machine learning. (24th October 2019)
Main Title:: Imputation and characterization of uncoded self-harm in major mental illness using machine learning
Authors:: Kumar, Praveen
Nestsiarovich, Anastasiya
Nelson, Stuart J
Kerner, Berit
Perkins, Douglas J
Lambert, Christophe G
Abstract:: Abstract: Objective: We aimed to impute uncoded self-harm in administrative claims data of individuals with major mental illness (MMI), characterize self-harm incidence, and identify factors associated with coding bias. Materials and Methods: The IBM MarketScan database (2003-2016) was used to analyze visit-level self-harm in 10 120 030 patients with ≥2 MMI codes. Five machine learning (ML) classifiers were tested on a balanced data subset, with XGBoost selected for the full dataset. Classification performance was validated via random data mislabeling and comparison with a clinician-derived "gold standard." The incidence of coded and imputed self-harm was characterized by year, patient age, sex, U.S. state, and MMI diagnosis. Results: Imputation identified 1 592 703 self-harm events vs 83 113 coded events, with areas under the curve >0.99 for the balanced and full datasets, and 83.5% agreement with the gold standard. The overall coded and imputed self-harm incidence were 0.28% and 5.34%, respectively, varied considerably by age and sex, and was highest in individuals with multiple MMI diagnoses. Self-harm undercoding was higher in male than in female individuals and increased with age. Substance abuse, injuries, poisoning, asphyxiation, brain disorders, harmful thoughts, and psychotherapy were the main features used by ML to classify visits. Discussion: Only 1 of 19 self-harm events was coded for individuals with MMI. ML demonstrated excellent performance in recovering … (more)
Is Part Of:: Journal of the American Medical Informatics Association. Volume 27:Number 1(2020)
Journal:: Journal of the American Medical Informatics Association
Issue:: Volume 27:Number 1(2020)
Issue Display:: Volume 27, Issue 1 (2020)
Year:: 2020
Volume:: 27
Issue:: 1
Issue Sort Value:: 2020-0027-0001-0000
Page Start:: 136
Page End:: 146
Publication Date:: 2019-10-24
Subjects:: self-harm -- suicide -- machine learning -- coding -- electronic health records
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285
Journal URLs:: http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗
DOI:: 10.1093/jamia/ocz173 ↗
Languages:: English
ISSNs:: 1067-5027
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store
Ingest File:: 15130.xml