R* : A Robust MCMC Convergence Diagnostic with Uncertainty Using Decision Tree Classifiers

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorLambert, Benen_US
dc.contributor.authorVehtari, Akien_US
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorComputer Science Professorsen
dc.contributor.groupauthorComputer Science - Artificial Intelligence and Machine Learning (AIML)en
dc.contributor.groupauthorProbabilistic Machine Learningen
dc.contributor.groupauthorHelsinki Institute for Information Technology (HIIT)en
dc.contributor.groupauthorProfessorship Vehtari Akien
dc.contributor.organizationImperial College Londonen_US
dc.date.accessioned2022-08-10T08:17:40Z
dc.date.available2022-08-10T08:17:40Z
dc.date.issued2022-06en_US
dc.descriptionFunding Information: The authors would like to thank the anonymous reviewers for comments on previous drafts of the paper that lead to significant improvements. We would also like to thank Paul Bürkner and Jonah Gabry, with whom useful discussions were had during the preparation of this manuscript. Publisher Copyright: © 2022. International Society for Bayesian Analysis
dc.description.abstractMarkov chain Monte Carlo (MCMC) has transformed Bayesian model inference over the past three decades: mainly because of this, Bayesian inference is now a workhorse of applied scientists. Under general conditions, MCMC sampling converges asymptotically to the posterior distribution, but this provides no guarantees about its performance in finite time. The predominant method for monitoring convergence is to run multiple chains and monitor individual chains’ characteristics and compare these to the population as a whole: if within-chain and between-chain summaries are comparable, then this is taken to indicate that the chains have converged to a common stationary distribution. Here, we introduce a new method for diagnosing convergence based on how well a machine learning classifier model can successfully discriminate the individual chains. We call this convergence measure R*. In contrast to the predominant R, R* is a single statistic across all parameters that indicates lack of mixing, although individual variables’ importance for this metric can also be determined. Additionally, R* is not based on any single characteristic of the sampling distribution; instead it uses all the information in the chain, including that given by the joint sampling distribution, which is currently largely overlooked by existing approaches. We recommend calculating R* using two different machine learning classifiers — gradient-boosted regression trees and random forests — which each work well in models of different dimensions. Because each of these methods outputs a classification probability, as a byproduct, we obtain uncertainty in R ∗. The method is straightforward to implement and could be a complementary additional check on MCMC convergence for applied analyses.en
dc.description.versionPeer revieweden
dc.format.extent27
dc.format.extent353-379
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationLambert, B & Vehtari, A 2022, ' R* : A Robust MCMC Convergence Diagnostic with Uncertainty Using Decision Tree Classifiers ', Bayesian Analysis, vol. 17, no. 2, pp. 353-379 . https://doi.org/10.1214/20-BA1252en
dc.identifier.doi10.1214/20-BA1252en_US
dc.identifier.issn1936-0975
dc.identifier.otherPURE UUID: 4497d186-2443-4429-86a1-ccf3f0055c2aen_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/4497d186-2443-4429-86a1-ccf3f0055c2aen_US
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85131044234&partnerID=8YFLogxKen_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/85533802/R_A_Robust_MCMC_Convergence_Diagnostic_with_Uncertainty_Using_Decision_Tree_Classifiers.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/115746
dc.identifier.urnURN:NBN:fi:aalto-202208104568
dc.language.isoenen
dc.publisherINT SOC BAYESIAN ANALYSIS
dc.relation.ispartofseriesBayesian Analysisen
dc.relation.ispartofseriesVolume 17, issue 2en
dc.rightsopenAccessen
dc.titleR* : A Robust MCMC Convergence Diagnostic with Uncertainty Using Decision Tree Classifiersen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files