A combined strategy of feature selection and machine learning to identify predictors of prediabetes. (30th December 2019)

Record Type:: Journal Article
Title:: A combined strategy of feature selection and machine learning to identify predictors of prediabetes. (30th December 2019)
Main Title:: A combined strategy of feature selection and machine learning to identify predictors of prediabetes
Authors:: De Silva, Kushan
Jönsson, Daniel
Demmer, Ryan T
Abstract:: Abstract: Objective: To identify predictors of prediabetes using feature selection and machine learning on a nationally representative sample of the US population. Materials and Methods: We analyzed n = 6346 men and women enrolled in the National Health and Nutrition Examination Survey 2013–2014. Prediabetes was defined using American Diabetes Association guidelines. The sample was randomly partitioned to training (n = 3174) and internal validation (n = 3172) sets. Feature selection algorithms were run on training data containing 156 preselected exposure variables. Four machine learning algorithms were applied on 46 exposure variables in original and resampled training datasets built using 4 resampling methods. Predictive models were tested on internal validation data (n = 3172) and external validation data (n = 3000) prepared from National Health and Nutrition Examination Survey 2011–2012. Model performance was evaluated using area under the receiver operating characteristic curve (AUROC). Predictors were assessed by odds ratios in logistic models and variable importance in others. The Centers for Disease Control (CDC) prediabetes screening tool was the benchmark to compare model performance. Results: Prediabetes prevalence was 23.43%. The CDC prediabetes screening tool produced 64.40% AUROC. Seven optimal (≥ 70% AUROC) models identified 25 predictors including 4 potentially novel associations; 20 by both logistic and other nonlinear/ensemble models and 5 solely by the … (more)
Is Part Of:: Journal of the American Medical Informatics Association. Volume 27:Number 3(2020)
Journal:: Journal of the American Medical Informatics Association
Issue:: Volume 27:Number 3(2020)
Issue Display:: Volume 27, Issue 3 (2020)
Year:: 2020
Volume:: 27
Issue:: 3
Issue Sort Value:: 2020-0027-0003-0000
Page Start:: 396
Page End:: 406
Publication Date:: 2019-12-30
Subjects:: prediabetes -- predictors -- machine learning -- feature selection -- NHANES
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285
Journal URLs:: http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗
DOI:: 10.1093/jamia/ocz204 ↗
Languages:: English
ISSNs:: 1067-5027
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store
Ingest File:: 15174.xml