A simple new approach to variable selection in regression, with application to genetic fine mapping. (10th July 2020)
- Record Type:
- Journal Article
- Title:
- A simple new approach to variable selection in regression, with application to genetic fine mapping. (10th July 2020)
- Main Title:
- A simple new approach to variable selection in regression, with application to genetic fine mapping
- Authors:
- Wang, Gao
Sarkar, Abhishek
Carbonetto, Peter
Stephens, Matthew - Abstract:
- Summary: We introduce a simple new approach to variable selection in linear regression, with a particular focus on quantifying uncertainty in which variables should be selected . The approach is based on a new model—the 'sum of single effects' model, called ' SuSiE '—which comes from writing the sparse vector of regression coefficients as a sum of 'single‐effect' vectors, each with one non‐zero element. We also introduce a corresponding new fitting procedure—iterative Bayesian stepwise selection (IBSS)—which is a Bayesian analogue of stepwise selection methods. IBSS shares the computational simplicity and speed of traditional stepwise methods but, instead of selecting a single variable at each step, IBSS computes a distribution on variables that captures uncertainty in which variable to select. We provide a formal justification of this intuitive algorithm by showing that it optimizes a variational approximation to the posterior distribution under SuSiE. Further, this approximate posterior distribution naturally yields convenient novel summaries of uncertainty in variable selection, providing a credible set of variables for each selection. Our methods are particularly well suited to settings where variables are highly correlated and detectable effects are sparse, both of which are characteristics of genetic fine mapping applications. We demonstrate through numerical experiments that our methods outperform existing methods for this task, and we illustrate their application toSummary: We introduce a simple new approach to variable selection in linear regression, with a particular focus on quantifying uncertainty in which variables should be selected . The approach is based on a new model—the 'sum of single effects' model, called ' SuSiE '—which comes from writing the sparse vector of regression coefficients as a sum of 'single‐effect' vectors, each with one non‐zero element. We also introduce a corresponding new fitting procedure—iterative Bayesian stepwise selection (IBSS)—which is a Bayesian analogue of stepwise selection methods. IBSS shares the computational simplicity and speed of traditional stepwise methods but, instead of selecting a single variable at each step, IBSS computes a distribution on variables that captures uncertainty in which variable to select. We provide a formal justification of this intuitive algorithm by showing that it optimizes a variational approximation to the posterior distribution under SuSiE. Further, this approximate posterior distribution naturally yields convenient novel summaries of uncertainty in variable selection, providing a credible set of variables for each selection. Our methods are particularly well suited to settings where variables are highly correlated and detectable effects are sparse, both of which are characteristics of genetic fine mapping applications. We demonstrate through numerical experiments that our methods outperform existing methods for this task, and we illustrate their application to fine mapping genetic variants influencing alternative splicing in human cell lines. We also discuss the potential and challenges for applying these methods to generic variable‐selection problems. … (more)
- Is Part Of:
- Journal of the Royal Statistical Society. Volume 82:Number 5(2020)
- Journal:
- Journal of the Royal Statistical Society
- Issue:
- Volume 82:Number 5(2020)
- Issue Display:
- Volume 82, Issue 5 (2020)
- Year:
- 2020
- Volume:
- 82
- Issue:
- 5
- Issue Sort Value:
- 2020-0082-0005-0000
- Page Start:
- 1273
- Page End:
- 1300
- Publication Date:
- 2020-07-10
- Subjects:
- Genetic fine mapping -- Linear regression -- Sparsity -- Variable selection -- Variational inference
Statistics -- Periodicals
Great Britain -- Statistics -- Periodicals
519.2 - Journal URLs:
- http://www.blackwellpublishing.com/journal.asp?ref=1369-7412 ↗
https://rss.onlinelibrary.wiley.com/journal/14679868 ↗
https://academic.oup.com/jrsssb ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/rssb.12388 ↗
- Languages:
- English
- ISSNs:
- 1369-7412
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4867.020000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 14685.xml