Functional clustering methods for longitudinal data with application to electronic health records. (March 2021)
- Record Type:
- Journal Article
- Title:
- Functional clustering methods for longitudinal data with application to electronic health records. (March 2021)
- Main Title:
- Functional clustering methods for longitudinal data with application to electronic health records
- Authors:
- Zeldow, Bret
Flory, James
Stephens-Shields, Alisa
Raebel, Marsha
Roy, Jason A - Abstract:
- We develop a method to estimate subject-level trajectory functions from longitudinal data. The approach can be used for patient phenotyping, feature extraction, or, as in our motivating example, outcome identification, which refers to the process of identifying disease status through patient laboratory tests rather than through diagnosis codes or prescription information. We model the joint distribution of a continuous longitudinal outcome and baseline covariates using an enriched Dirichlet process prior. This joint model decomposes into (local) semiparametric linear mixed models for the outcome given the covariates and simple (local) marginals for the covariates. The nonparametric enriched Dirichlet process prior is placed on the regression and spline coefficients, the error variance, and the parameters governing the predictor space. This leads to clustering of patients based on their outcomes and covariates. We predict the outcome at unobserved time points for subjects with data at other time points as well as for new subjects with only baseline covariates. We find improved prediction over mixed models with Dirichlet process priors when there are a large number of covariates. Our method is demonstrated with electronic health records consisting of initiators of second-generation antipsychotic medications, which are known to increase the risk of diabetes. We use our model to predict laboratory values indicative of diabetes for each individual and assess incidence ofWe develop a method to estimate subject-level trajectory functions from longitudinal data. The approach can be used for patient phenotyping, feature extraction, or, as in our motivating example, outcome identification, which refers to the process of identifying disease status through patient laboratory tests rather than through diagnosis codes or prescription information. We model the joint distribution of a continuous longitudinal outcome and baseline covariates using an enriched Dirichlet process prior. This joint model decomposes into (local) semiparametric linear mixed models for the outcome given the covariates and simple (local) marginals for the covariates. The nonparametric enriched Dirichlet process prior is placed on the regression and spline coefficients, the error variance, and the parameters governing the predictor space. This leads to clustering of patients based on their outcomes and covariates. We predict the outcome at unobserved time points for subjects with data at other time points as well as for new subjects with only baseline covariates. We find improved prediction over mixed models with Dirichlet process priors when there are a large number of covariates. Our method is demonstrated with electronic health records consisting of initiators of second-generation antipsychotic medications, which are known to increase the risk of diabetes. We use our model to predict laboratory values indicative of diabetes for each individual and assess incidence of suspected diabetes from the predicted dataset. … (more)
- Is Part Of:
- Statistical methods in medical research. Volume 30:Number 3(2021)
- Journal:
- Statistical methods in medical research
- Issue:
- Volume 30:Number 3(2021)
- Issue Display:
- Volume 30, Issue 3 (2021)
- Year:
- 2021
- Volume:
- 30
- Issue:
- 3
- Issue Sort Value:
- 2021-0030-0003-0000
- Page Start:
- 655
- Page End:
- 670
- Publication Date:
- 2021-03
- Subjects:
- Outcome identification -- Bayesian nonparametrics -- prediction -- functional clustering -- Dirichlet process
Medicine -- Research -- Statistical methods -- Periodicals
Research -- Periodicals
Review Literature -- Periodicals
Statistics -- methods -- Periodicals
Médecine -- Recherche -- Méthodes statistiques -- Périodiques
610.727 - Journal URLs:
- http://smm.sagepub.com/ ↗
http://www.ingentaselect.com/rpsv/cw/arn/09622802/contp1.htm ↗
http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=0962-2802;screen=info;ECOIP ↗ - DOI:
- 10.1177/0962280220965630 ↗
- Languages:
- English
- ISSNs:
- 0962-2802
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15299.xml