Bayesian Computation Via Markov Chain Monte Carlo

Radu V. Craiu; Jeffrey S. Rosenthal

doi:10.1146/annurev-statistics-022513-115540

Annual Review of Statistics and Its Application

Volume 1, 2014

Review Article

Free

Bayesian Computation Via Markov Chain Monte Carlo

Radu V. Craiu¹, and Jeffrey S. Rosenthal¹
View Affiliations Hide Affiliations

Affiliations: Department of Statistics, University of Toronto, Toronto, Ontario M5S SG3, Canada; email: [email protected]
Vol. 1:179-201 (Volume publication date January 2014) https://doi.org/10.1146/annurev-statistics-022513-115540
First published as a Review in Advance on October 30, 2013
© Annual Reviews

Abstract

Markov chain Monte Carlo (MCMC) algorithms are an indispensable tool for performing Bayesian inference. This review discusses widely used sampling algorithms and illustrates their implementation on a probit regression model for lupus data. The examples considered highlight the importance of tuning the simulation parameters and underscore the important contributions of modern developments such as adaptive MCMC. We then use the theory underlying MCMC to explain the validity of the algorithms considered and to assess the variance of the resulting Monte Carlo estimators.

Keyword(s): adaptive MCMC, Gibbs sampler, Markov chain Monte Carlo, Metropolis sampler, parallel tempering

Article metrics loading...

/content/journals/10.1146/annurev-statistics-022513-115540

2014-01-03

2024-05-14

Full text loading...

/deliver/fulltext/statistics/1/1/annurev-statistics-022513-115540.html?itemId=/content/journals/10.1146/annurev-statistics-022513-115540&mimeType=html&fmt=ahah

Literature Cited

Adler SL. 1981. Over-relaxation methods for the Monte Carlo evaluation of the partition function for multiquadratic actions. Phys. Rev. D 23:2901–4 [Google Scholar]
Albert JH, Chib S. 1993. Bayesian analysis of binary and polychotomous response data. J. Am. Stat. Assoc. 88:669–79 [Google Scholar]
Amit Y. 1991. On rates of convergence of stochastic relaxation for Gaussian and non-Gaussian distributions. J. Multivar. Anal. 38:82–100 [Google Scholar]
Amit Y. 1996. Convergence properties of the Gibbs sampler for perturbations of Gaussians. Ann. Stat. 24:122–40 [Google Scholar]
Andrieu C, Moulines E, Priouret P. 2005. Stability of stochastic approximation under verifiable conditions. SIAM J. Control Optim. 44:283–312 [Google Scholar]
Bai Y, Craiu RV, DiNarzo AF. 2011. Divide and conquer: a mixture-based approach to regional adaptation for MCMC. J. Comput. Graph. Stat. 20:63–79 [Google Scholar]
Barone P, Frigessi A. 1990. Improving stochastic relaxation for Gaussian random fields. Probab. Eng. Inf. Sci. 4:369–89 [Google Scholar]
Bedard M. 2006. On the robustness of optimal scaling for random walk Metropolis algorithms PhD Thesis, Department of Statistics, Univ. Toronto
Brooks S, Gelman A, Jones GL, Meng X-L. 2011. Handbook of Markov Chain Monte Carlo Boca Raton, FL: Chapman & Hall/CRC
Brooks SP, Gelman A. 1998. General methods for monitoring convergence of iterative simulations. J. Comput. Graph. Stat. 7:434–55 [Google Scholar]
Casarin R, Craiu RV, Leisen F. 2013. Interacting multiple try algorithms with different proposal distributions. Stat. Comput. 23:185–200 [Google Scholar]
Chen M-H, Shao Q-M, Ibrahim JG. 2000. Monte Carlo Methods in Bayesian Computation New York: Springer
Craiu RV, Lemieux C. 2007. Acceleration of the multiple-try Metropolis algorithm using antithetic and stratified sampling. Stat. Comput. 17:109–20 [Google Scholar]
Craiu RV, Meng X-L. 2005. Multi-process parallel antithetic coupling for forward and backward MCMC. Ann. Stat. 33:661–97 [Google Scholar]
Craiu RV, Meng X-L. 2011. Perfection within reach: exact MCMC sampling. Handbook of Markov Chain Monte Carlo S Brooks, A Gelman, GL Jones, X-L Meng 199–226 Boca Raton, FL: Chapman & Hall/CRC [Google Scholar]
Craiu RV, Rosenthal JS, Yang C. 2009. Learn from thy neighbor: parallel-chain adaptive and regional MCMC. J. Am. Stat. Assoc. 104:1454–66 [Google Scholar]
Douc R, Moulines E, Rosenthal JS. 2004. Quantitative bounds on convergence of time-inhomogeneous Markov chains. Ann. Appl. Probab. 14:1643–65 [Google Scholar]
Flegal JM, Haran M, Jones GL. 2008. Markov chain Monte Carlo: Can we trust the third significant figure?. Stat. Sci. 23:250–60 [Google Scholar]
Gelfand AE, Smith AFM. 1990. Sampling-based approaches to calculating marginal densities. J. Am. Stat. Assoc. 85:398–409 [Google Scholar]
Gelman A, Rubin DB. 1992. Inference from iterative simulation using multiple sequences. Stat. Sci. 7:457–72 [Google Scholar]
Geman S, Geman D. 1984. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6:721–41 [Google Scholar]
Geyer CJ. 1992. Practical Markov chain Monte Carlo (with discussion). Stat. Sci. 7:473–83 [Google Scholar]
Green PJ. 1995. Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 82:711–32 [Google Scholar]
Green PJ, Mira A. 2001. Delayed rejection in reversible jump Metropolis-Hastings. Biometrika 88:1035–53 [Google Scholar]
Haario H, Saksman E, Tamminen J. 2001. An adaptive Metropolis algorithm. Bernoulli 7:223–42 [Google Scholar]
Hastings WK. 1970. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57:97–109 [Google Scholar]
Jones G, Hobert J. 2001. Honest exploration of intractable probability distributions via Markov chain Monte Carlo. Stat. Sci. 16:312–34 [Google Scholar]
Liu JS. 2001. Monte Carlo Strategies in Scientific Computing New York: Springer
Liu JS, Liang F, Wong WH. 2000. The multiple-try method and local optimization in Metropolis sampling. J. Am. Stat. Assoc. 95:121–34 [Google Scholar]
Liu JS, Wong WH, Kong A. 1994. Covariance structure of the Gibbs sampler with applications to the comparisons of estimators and augmentation schemes. Biometrika 81:27–40 [Google Scholar]
Liu JS, Wong WH, Kong A. 1995. Covariance structure and convergence rate of the Gibbs sampler with various scans. J. R. Stat. Soc. B 57:157–69 [Google Scholar]
Liu JS, Wu YN. 1999. Parameter expansion for data augmentation. J. Am. Stat. Assoc. 94:1264–74 [Google Scholar]
Meng X-L, van Dyk DA. 1999. Seeking efficient data augmentation schemes via conditional and marginal augmentation. Biometrika 86:301–20 [Google Scholar]
Mengersen KL, Tweedie RL. 1996. Rates of convergence of the Hastings and Metropolis algorithms. Ann. Stat. 24:101–21 [Google Scholar]
Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E. 1953. Equations of state calculations by fast computing machines. J. Chem. Phys. 21:1087–92 [Google Scholar]
Meyn SP, Tweedie RL. 1993. Markov Chains and Stochastic Stability London: Springer-Verlag
Neal RM. 1995. Suppressing random walks in Markov chain Monte Carlo using ordered overrelaxation Tech. Rep. 9508, Univ. Toronto. Dep. Stat., Toronto, Can. http://arxiv.org/pdf/bayes-an/9506004v1.pdf
Papaspiliopoulos O, Roberts GO. 2008. Stability of the Gibbs sampler for Bayesian hierarchical models. Ann. Stat. 36:95–117 [Google Scholar]
Propp JG, Wilson DB. 1996. Exact sampling with coupled Markov chains and applications to statistical mechanics. Random Struct. Algorithms 9:223–52 [Google Scholar]
Richardson S, Green PJ. 1997. On Bayesian analysis of mixtures with an unknown number of components (with discussion). J. R. Stat. Soc. B 59:731–92 [Google Scholar]
Robert CP, Casella G. 2004. Monte Carlo Statistical Methods New York: Springer
Robert CP, Casella G. 2010. Introducing Monte Carlo Methods with R New York: Springer
Roberts GO, Gelman A, Gilks WR. 1997. Weak convergence and optimal scaling of random walk Metropolis algorithms. Ann. Appl. Probab. 7:110–20 [Google Scholar]
Roberts GO, Rosenthal JS. 1997. Geometric ergodicity and hybrid Markov chains. Electron. Commun. Probab. 2:213–25 [Google Scholar]
Roberts GO, Rosenthal JS. 2001. Optimal scaling for various Metropolis-Hastings algorithms. Stat. Sci. 16:351–67 [Google Scholar]
Roberts GO, Rosenthal JS. 2004. General state space Markov chains and MCMC algorithms. Probab. Surv. 1:20–71 [Google Scholar]
Roberts GO, Rosenthal JS. 2007. Coupling and ergodicity of adaptive Markov chain Monte Carlo algorithms. J. Appl. Probab. 44:458–75 [Google Scholar]
Roberts GO, Rosenthal JS. 2009. Examples of adaptive MCMC. J. Comput. Graph. Stat. 18:349–67 [Google Scholar]
Roberts GO, Tweedie RL. 1996. Geometric convergence and central limit theorems for multidimensional Hastings and Metropolis algorithms. Biometrika 83:95–110 [Google Scholar]
Rosenthal JS. 1995. Minorization conditions and convergence rates for Markov chain Monte Carlo. J. Am. Stat. Assoc. 90:558–66 [Google Scholar]
Rosenthal JS. 2001. A review of asymptotic convergence for general state space Markov chains. Far East J. Theor. Stat. 5:37–50 [Google Scholar]
Rosenthal JS. 2002. Quantitative convergence rates of Markov chains: a simple account. Electron. Commun. Probab. 7:123–28 [Google Scholar]
Rosenthal JS, Roberts GO. 2011. Quantitative non-geometric convergence bounds for independence samplers. Methodol. Comput. Appl. Probab. 13:391–403 [Google Scholar]
Spiegelhalter DJ, Best NG, Carlin BP, Van Der Linde A. 2002. Bayesian measures of model complexity and fit (with discussion). J. R. Stat. Soc. B 64:583–639 [Google Scholar]
Tanner MA, Wong WH. 1987. The calculation of posterior distributions by data augmentation. J. Am. Stat. Assoc. 82:528–40 [Google Scholar]
Tierney L. 1994. Markov chains for exploring posterior distributions. Ann. Stat. 22:1701–28 [Google Scholar]
van Dyk DA, Meng X-L. 2001. The art of data augmentation (with discussion). J. Comput. Graph. Stat. 10:1–111 [Google Scholar]

/content/journals/10.1146/annurev-statistics-022513-115540

Bayesian Computation Via Markov Chain Monte Carlo

Annual Review of Statistics and Its Application 1, 179 (2014); https://doi.org/10.1146/annurev-statistics-022513-115540

/content/journals/10.1146/annurev-statistics-022513-115540

Data & Media loading...

Article Type: Review Article

Most Cited Most Cited RSS feed

- Functional Data Analysis
  
  Jane-Ling Wang, Jeng-Min Chiou, and Hans-Georg Müller
  
  Vol. 3 (2016), pp. 257–295
- Probabilistic Forecasting
  
  Tilmann Gneiting, and Matthias Katzfuss
  
  Vol. 1 (2014), pp. 125–151
- Bayesian Computing with INLA: A Review
  
  Håvard Rue, Andrea Riebler, Sigrunn H. Sørbye, Janine B. Illian, Daniel P. Simpson, and Finn K. Lindgren
  
  Vol. 4 (2017), pp. 395–421
- Functional Regression
  
  Jeffrey S. Morris
  
  Vol. 2 (2015), pp. 321–359
- Topological Data Analysis
  
  Larry Wasserman
  
  Vol. 5 (2018), pp. 501–532
- Algorithmic Fairness: Choices, Assumptions, and Definitions
  
  Shira Mitchell, Eric Potash, Solon Barocas, Alexander D'Amour, and Kristian Lum
  
  Vol. 8 (2021), pp. 141–163
- Microbiome, Metagenomics, and High-Dimensional Compositional Data Analysis
  
  Hongzhe Li
  
  Vol. 2 (2015), pp. 73–94
- Learning Deep Generative Models
  
  Ruslan Salakhutdinov
  
  Vol. 2 (2015), pp. 361–385
- On p-Values and Bayes Factors
  
  Leonhard Held, and Manuela Ott
  
  Vol. 5 (2018), pp. 393–419
- High-Dimensional Statistics with a View Toward Applications in Biology
  
  Peter Bühlmann, Markus Kalisch, and Lukas Meier
  
  Vol. 1 (2014), pp. 255–278
More Less

Annual Review of Statistics and Its Application

Volume 1, 2014

Review Article

Free

Bayesian Computation Via Markov Chain Monte Carlo

Abstract

Most Read This Month

Most Cited Most Cited RSS feed