A Bootstrap Metropolis–Hastings Algorithm for Bayesian Analysis of Big Data. Issue 3 (2nd July 2016)
- Record Type:
- Journal Article
- Title:
- A Bootstrap Metropolis–Hastings Algorithm for Bayesian Analysis of Big Data. Issue 3 (2nd July 2016)
- Main Title:
- A Bootstrap Metropolis–Hastings Algorithm for Bayesian Analysis of Big Data
- Authors:
- Liang, Faming
Kim, Jinsu
Song, Qifan - Abstract:
- Abstract : Markov chain Monte Carlo (MCMC) methods have proven to be a very powerful tool for analyzing data of complex structures. However, their computer-intensive nature, which typically require a large number of iterations and a complete scan of the full dataset for each iteration, precludes their use for big data analysis. In this article, we propose the so-called bootstrap Metropolis–Hastings (BMH) algorithm that provides a general framework for how to tame powerful MCMC methods to be used for big data analysis, that is, to replace the full data log-likelihood by a Monte Carlo average of the log-likelihoods that are calculated in parallel from multiple bootstrap samples. The BMH algorithm possesses an embarrassingly parallel structure and avoids repeated scans of the full dataset in iterations, and is thus feasible for big data problems. Compared to the popular divide-and-combine method, BMH can be generally more efficient as it can asymptotically integrate the whole data information into a single simulation run. The BMH algorithm is very flexible. Like the Metropolis–Hastings algorithm, it can serve as a basic building block for developing advanced MCMC algorithms that are feasible for big data problems. This is illustrated in the article by the tempering BMH algorithm, which can be viewed as a combination of parallel tempering and the BMH algorithm. BMH can also be used for model selection and optimization by combining with reversible jump MCMC and simulatedAbstract : Markov chain Monte Carlo (MCMC) methods have proven to be a very powerful tool for analyzing data of complex structures. However, their computer-intensive nature, which typically require a large number of iterations and a complete scan of the full dataset for each iteration, precludes their use for big data analysis. In this article, we propose the so-called bootstrap Metropolis–Hastings (BMH) algorithm that provides a general framework for how to tame powerful MCMC methods to be used for big data analysis, that is, to replace the full data log-likelihood by a Monte Carlo average of the log-likelihoods that are calculated in parallel from multiple bootstrap samples. The BMH algorithm possesses an embarrassingly parallel structure and avoids repeated scans of the full dataset in iterations, and is thus feasible for big data problems. Compared to the popular divide-and-combine method, BMH can be generally more efficient as it can asymptotically integrate the whole data information into a single simulation run. The BMH algorithm is very flexible. Like the Metropolis–Hastings algorithm, it can serve as a basic building block for developing advanced MCMC algorithms that are feasible for big data problems. This is illustrated in the article by the tempering BMH algorithm, which can be viewed as a combination of parallel tempering and the BMH algorithm. BMH can also be used for model selection and optimization by combining with reversible jump MCMC and simulated annealing, respectively. Supplementary materials for this article are available online. … (more)
- Is Part Of:
- Technometrics. Volume 58:Issue 3(2016)
- Journal:
- Technometrics
- Issue:
- Volume 58:Issue 3(2016)
- Issue Display:
- Volume 58, Issue 3 (2016)
- Year:
- 2016
- Volume:
- 58
- Issue:
- 3
- Issue Sort Value:
- 2016-0058-0003-0000
- Page Start:
- 304
- Page End:
- 318
- Publication Date:
- 2016-07-02
- Subjects:
- Big data -- Bootstrap -- Markov chain Monte Carlo -- Metropolis–Hastings -- Parallel computing.
Statistical physics -- Periodicals
Chemistry -- Statistical methods -- Periodicals
Engineering -- Statistical methods -- Periodicals
519.5 - Journal URLs:
- http://pubs.amstat.org/loi/tech ↗
http://www.tandf.co.uk/journals/UTCH ↗
http://www.tandfonline.com/toc/utch20/current ↗
http://www.tandfonline.com/ ↗
http://www.ingentaconnect.com/content/asa/tech ↗ - DOI:
- 10.1080/00401706.2016.1142905 ↗
- Languages:
- English
- ISSNs:
- 0040-1706
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8761.050000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 1221.xml