Efficient posterior sampling for high-dimensional imbalanced logistic regression. (17th June 2020)
- Record Type:
- Journal Article
- Title:
- Efficient posterior sampling for high-dimensional imbalanced logistic regression. (17th June 2020)
- Main Title:
- Efficient posterior sampling for high-dimensional imbalanced logistic regression
- Authors:
- Sen, Deborshee
Sachs, Matthias
Lu, Jianfeng
Dunson, David B - Abstract:
- Summary: Classification with high-dimensional data is of widespread interest and often involves dealing with imbalanced data. Bayesian classification approaches are hampered by the fact that current Markov chain Monte Carlo algorithms for posterior computation become inefficient as the number $p$ of predictors or the number $n$ of subjects to classify gets large, because of the increasing computational time per step and worsening mixing rates. One strategy is to employ a gradient-based sampler to improve mixing while using data subsamples to reduce the per-step computational complexity. However, the usual subsampling breaks down when applied to imbalanced data. Instead, we generalize piecewise-deterministic Markov chain Monte Carlo algorithms to include importance-weighted and mini-batch subsampling. These maintain the correct stationary distribution with arbitrarily small subsamples and substantially outperform current competitors. We provide theoretical support for the proposed approach and demonstrate its performance gains in simulated data examples and an application to cancer data.
- Is Part Of:
- Biometrika. Volume 107:Number 4(2020:Dec.)
- Journal:
- Biometrika
- Issue:
- Volume 107:Number 4(2020:Dec.)
- Issue Display:
- Volume 107, Issue 4 (2020)
- Year:
- 2020
- Volume:
- 107
- Issue:
- 4
- Issue Sort Value:
- 2020-0107-0004-0000
- Page Start:
- 1005
- Page End:
- 1012
- Publication Date:
- 2020-06-17
- Subjects:
- Imbalanced data -- Logistic regression -- Piecewise-deterministic Markov process -- Scalable inference -- Subsampling
Biometry -- Periodicals
570.1519505 - Journal URLs:
- http://www.oup.co.uk/biomet/contents ↗
http://biomet.oxfordjournals.org ↗
http://www.jstor.org/journals/00063444.html ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗
http://www.ingenta.com/journals/browse/oup/biomet?mode=direct ↗ - DOI:
- 10.1093/biomet/asaa035 ↗
- Languages:
- English
- ISSNs:
- 0006-3444
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2089.000000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15236.xml