A Framework for Improving the Reliability of Black-box Variational Inference

Loading...
Thumbnail Image

Access rights

openAccess

URL

Journal Title

Journal ISSN

Volume Title

A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Date

2024

Major/Subject

Mcode

Degree programme

Language

en

Pages

Series

Journal of Machine Learning Research, Volume 25, pp. 1−71

Abstract

Black-box variational inference (BBVI) now sees widespread use in machine learning and statistics as a fast yet flexible alternative to Markov chain Monte Carlo methods for approximate Bayesian inference. However, stochastic optimization methods for BBVI remain unreliable and require substantial expertise and hand-tuning to apply effectively. In this paper, we propose robust and automated black-box VI (RABVI), a framework for improving the reliability of BBVI optimization. RABVI is based on rigorously justified automation techniques, includes just a small number of intuitive tuning parameters, and detects inaccurate estimates of the optimal variational approximation. RABVI adaptively decreases the learning rate by detecting convergence of the fixed--learning-rate iterates, then estimates the symmetrized Kullback--Leibler (KL) divergence between the current variational approximation and the optimal one. It also employs a novel optimization termination criterion that enables the user to balance desired accuracy against computational cost by comparing (i) the predicted relative decrease in the symmetrized KL divergence if a smaller learning were used and (ii) the predicted computation required to converge with the smaller learning rate. We validate the robustness and accuracy of RABVI through carefully designed simulation studies and on a diverse set of real-world model and data examples.

Description

Keywords

Bayesian computation, Monte Carlo, Diagnostics, Importance sampling

Other note

Citation

Welandawe, M, Riis Andersen, M, Vehtari, A & Huggins, J H 2024, ' A Framework for Improving the Reliability of Black-box Variational Inference ', Journal of Machine Learning Research, vol. 25, pp. 1−71 . < https://www.jmlr.org/papers/v25/22-0327.html >