Evaluating the robustness of targeted maximum likelihood estimators via realistic simulations in nutrition intervention trials. (16th February 2022)
- Record Type:
- Journal Article
- Title:
- Evaluating the robustness of targeted maximum likelihood estimators via realistic simulations in nutrition intervention trials. (16th February 2022)
- Main Title:
- Evaluating the robustness of targeted maximum likelihood estimators via realistic simulations in nutrition intervention trials
- Authors:
- Li, Haodong
Rosete, Sonali
Coyle, Jeremy
Phillips, Rachael V.
Hejazi, Nima S.
Malenica, Ivana
Arnold, Benjamin F.
Benjamin‐Chung, Jade
Mertens, Andrew
Colford, John M.
van der Laan, Mark J.
Hubbard, Alan E. - Abstract:
- Abstract : Several recently developed methods have the potential to harness machine learning in the pursuit of target quantities inspired by causal inference, including inverse weighting, doubly robust estimating equations and substitution estimators like targeted maximum likelihood estimation. There are even more recent augmentations of these procedures that can increase robustness, by adding a layer of cross‐validation (cross‐validated targeted maximum likelihood estimation and double machine learning, as applied to substitution and estimating equation approaches, respectively). While these methods have been evaluated individually on simulated and experimental data sets, a comprehensive analysis of their performance across real data based simulations have yet to be conducted. In this work, we benchmark multiple widely used methods for estimation of the average treatment effect using ten different nutrition intervention studies data. A nonparametric regression method, undersmoothed highly adaptive lasso, is used to generate the simulated distribution which preserves important features from the observed data and reproduces a set of true target parameters. For each simulated data, we apply the methods above to estimate the average treatment effects as well as their standard errors and resulting confidence intervals. Based on the analytic results, a general recommendation is put forth for use of the cross‐validated variants of both substitution and estimating equationAbstract : Several recently developed methods have the potential to harness machine learning in the pursuit of target quantities inspired by causal inference, including inverse weighting, doubly robust estimating equations and substitution estimators like targeted maximum likelihood estimation. There are even more recent augmentations of these procedures that can increase robustness, by adding a layer of cross‐validation (cross‐validated targeted maximum likelihood estimation and double machine learning, as applied to substitution and estimating equation approaches, respectively). While these methods have been evaluated individually on simulated and experimental data sets, a comprehensive analysis of their performance across real data based simulations have yet to be conducted. In this work, we benchmark multiple widely used methods for estimation of the average treatment effect using ten different nutrition intervention studies data. A nonparametric regression method, undersmoothed highly adaptive lasso, is used to generate the simulated distribution which preserves important features from the observed data and reproduces a set of true target parameters. For each simulated data, we apply the methods above to estimate the average treatment effects as well as their standard errors and resulting confidence intervals. Based on the analytic results, a general recommendation is put forth for use of the cross‐validated variants of both substitution and estimating equation estimators. We conclude that the additional layer of cross‐validation helps in avoiding unintentional over‐fitting of nuisance parameter functionals and leads to more robust inferences. … (more)
- Is Part Of:
- Statistics in medicine. Volume 41:Number 12(2022)
- Journal:
- Statistics in medicine
- Issue:
- Volume 41:Number 12(2022)
- Issue Display:
- Volume 41, Issue 12 (2022)
- Year:
- 2022
- Volume:
- 41
- Issue:
- 12
- Issue Sort Value:
- 2022-0041-0012-0000
- Page Start:
- 2132
- Page End:
- 2165
- Publication Date:
- 2022-02-16
- Subjects:
- causal inference -- highly adaptive lasso -- machine learning -- realistic simulation -- targeted learning
Medical statistics -- Periodicals
Statistique médicale -- Périodiques
Statistiques médicales -- Périodiques
610.727 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/sim.9348 ↗
- Languages:
- English
- ISSNs:
- 0277-6715
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8453.576000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 21375.xml