Robust and Automated Variational Inference
Loading...
URL
Journal Title
Journal ISSN
Volume Title
School of Science |
Doctoral thesis (article-based)
| Defence date: 2023-01-20
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Authors
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
90 + app. 58
Series
Aalto University publication series DOCTORAL THESES, 196/2022
Abstract
Bayesian inference offers a sound and consistent framework to analyze data under uncertainty. Good decision making under uncertainty requires writing elaborate probabilistic models to model and simulate the systems we find around us. The hope is to obtain calibrated probabilistic predictions on unseen conditions. The challenge is that, inference for these models is intractable in general. This necessitates application of approximate inference techniques which can perform fast and accurate inference on probabilistic models. The success of Bayesian methods and any application which builds on it, depends to a large extent on the success of approximate inference algorithm chosen by the user. Variational inference has emerged as a popular approximate inference algorithm. It can be seen as an optimization problem where the task is to find an optimal distribution as an approximation to the true intractable posterior. This optimization requires computing fast and unbiased gradients. The contributions of this thesis can be broadly divided into two themes. The first part is the application of variational inference to the task of fitting a Gaussian process model where we have access to batches of observations, which do not have a numerical value, but are available as rankings in a set. Interestingly, the approximation of softmax link function for multi-class Gaussian Process classification can also be seen as a pairwise comparison of classes. This viewpoint helps in deriving a similar variational inference algorithm for scaling Gaussian Process classification to settings where the number of classes and data points is very large compared to existing algorithms. The second part of thesis deals with automated variational inference as a general tool of inference for probabilistic programs in context of modern programming frameworks which use automatic differentiation to compute gradients. The recent innovations in automatic differentiation software and algorithmic improvements in the form of computation of noisy unbiased gradients with Monte Carlo and mini-batching has made it possible to use model agnostic and standardized automated stochastic optimization-based algorithms and scale it to large datasets. In settings where accurate posterior inference is important, this work shows some potential pitfalls of current practices which may lead to incorrect conclusions. This work provides a wide set of diagnostic tools to evaluate if the stochastic optimization has worked well enough and the obtained solution is accurate enough to be used as approximation to the true posterior. This work concludes by providing a set of recommendations to the end user which is- either to use a more expressive approximating distribution or to reparameterize the model itself to hopefully end up with an easier posterior distribution.Description
Supervising professor
Vehtari, Aki, Prof., Aalto University, Department of Computer Science, FinlandThesis advisor
Vehtari, Aki, Prof., Aalto University, Department of Computer Science, FinlandKeywords
Other note
Parts
-
[Publication 1]: Akash Kumar Dhaka, Michael Riis Andersen, Pablo Garcia Moreno, Aki Vehtari. Scalable Gaussian Process for Extreme Classification. The International Workshop on Machine Learning for Signal Processing MLSP, Espoo, Finland, pages 1–6, October 2020.
Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-2020123160210DOI: 10.1109/MLSP49062.2020.9231675 View at publisher
-
[Publication 2]: Eero Siivola, Akash Kumar Dhaka, Michael Riis Andersen, Pablo Garcia Moreno, Javier Gonzalez, Aki Vehtari. Preferential Batch Bayesian Optimization. The International Workshop on Machine Learning for Signal Processing MLSP, Gold Coast, Australia, November 2021.
DOI: 10.1109/MLSP52302.2021.9596494 View at publisher
- [Publication 3]: Akash Kumar Dhaka, Alejandro Catalina, Michael Riis Andersen, Mans Magnusson, Jonathan Huggins and Aki Vehtari. Robust, Accurate Stochastic Optimization for Variational Inference. Advances in Neural Information Processing Systems, Volume 33, pages 10961–10973, 2020
- [Publication 4]: Akash Kumar Dhaka, Alejandro Catalina, Michael Riis Andersen, Manushi Welandawe, Jonathan Huggins, Aki Vehtari. Challenges and Opportunities in High-dimensional Variational Inference. Advances in Neural Information Processing Systems, Volume 34, 2021