Advances in distributed Bayesian inference and graph neural networks
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | Mesquita, Diego | |
dc.contributor.department | Tietotekniikan laitos | fi |
dc.contributor.department | Department of Computer Science | en |
dc.contributor.lab | Probabilistic Machine Learning (PML) | en |
dc.contributor.school | Perustieteiden korkeakoulu | fi |
dc.contributor.school | School of Science | en |
dc.contributor.supervisor | Kaski, Samuel, Prof., Aalto University, Department of Computer Science, Finland | |
dc.date.accessioned | 2021-11-11T10:00:07Z | |
dc.date.available | 2021-11-11T10:00:07Z | |
dc.date.defence | 2021-11-24 | |
dc.date.issued | 2021 | |
dc.description | Defence is held on 24.11.2021 12:00 – 16:00 Zoom, https://aalto.zoom.us/j/6031768727 | |
dc.description.abstract | Bayesian statistics and graph neural networks comprise a bag of tools widely employed in machine learning and applied sciences. The former rests on solid theoretical foundations, but its application depends on techniques that scale poorly as data increase. The latter is notorious for large-scale applications (e.g., in bioinformatics and natural language processing), but is largely only based on empirical intuitions. This thesis aims to i) broaden the scope of applications for Bayesian inference, and ii) deepen the understanding of core design principles of graph neural networks. First, we focus on distributed Bayesian inference under limited communication. We advance the state-of-the-art of embarrassingly parallel Markov chain Monte Carlo (MCMC) with a novel method that leverages normalizing flows as density estimators. On the same front, we also propose an extension of stochastic gradient Langevin dynamics for federated data, which are inherently distributed in a non-IID manner and cannot be centralized due to privacy constraints. Second, we develop a methodology for meta-analysis which allows the combination of Bayesian posteriors from different studies. Our approach is agnostic to study-specific complexities, which are all encapsulated in their respective posteriors. This extends the application of Bayesian meta-analysis to likelihood-free posteriors, which would otherwise be challenging. Our method also enables us to reuse posteriors from computationally costly analyses and update them post-hoc, without rerunning the analyses. Finally, we revisit two popular graph neural network components: spectral graph convolutions and pooling layers. Regarding convolutions, we propose a novel architecture and show that it is possible to achieve state-of-the-art performance by adding a minimal set of features to the most basic formulation of polynomial spectral convolutions. On the topic of pooling, we challenge the need for intricate pooling schemes and show that they do not play a role in the performance of graph networks in relevant benchmarks. | en |
dc.format.extent | 41 + app. 92 | |
dc.format.mimetype | application/pdf | en |
dc.identifier.isbn | 978-952-64-0609-1 (electronic) | |
dc.identifier.isbn | 978-952-64-0608-4 (printed) | |
dc.identifier.issn | 1799-4942 (electronic) | |
dc.identifier.issn | 1799-4934 (printed) | |
dc.identifier.issn | 1799-4934 (ISSN-L) | |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/110937 | |
dc.identifier.urn | URN:ISBN:978-952-64-0609-1 | |
dc.language.iso | en | en |
dc.opn | Vergari, Antonio, Prof., University of Edinburgh, UK | |
dc.publisher | Aalto University | en |
dc.publisher | Aalto-yliopisto | fi |
dc.relation.haspart | [Publication 1]: Diego Mesquita, Paul Blomstedt, and Samuel Kaski. Embarrassingly Parallel MCMC with deep invertible transformations. In Uncertainty in Artificial Intelligence, Tel-Aviv, Israel, July 2019. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-2020123160479. | |
dc.relation.haspart | [Publication 2]: Khaoula el Mekkaoui, Diego Mesquita, Paul Blomstedt, and Samuel Kaski. Federated stochastic gradient Langevin dynamics. In Uncertainty in Artificial Intelligence, Online, July 2021 | |
dc.relation.haspart | [Publication 3]: Paul Blomstedt, Diego Mesquita, Jarno Lintuusari, Tuomas Sivula, Jukka Corander, and Samuel kaski. Meta-analysis of Bayesian analyses. Submitted to a journal, 2020 | |
dc.relation.haspart | [Publication 4]: Diego Mesquita, Amauri Souza, and Samuel Kaski. Rethinking pooling in graph neural networks. In Advances in neural information processing systems, Online, December 2020. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-202102021917. | |
dc.relation.haspart | [Publication 5]: Hojin Kang, Jou-hui Ho, Diego Mesquita, Amauri Souza, Jorge Pérez, and Samuel Kaski. Spectral Graph Networks with Constrained Polynomials. Submitted to a journal, 2021 | |
dc.relation.ispartofseries | Aalto University publication series DOCTORAL DISSERTATIONS | en |
dc.relation.ispartofseries | 166/2021 | |
dc.rev | de Campos, Cassio, Prof., Eindhoven University of Technology, Netherlands | |
dc.rev | Lamb, Luis, Prof., Federal University of Rio Grande do Sul, Brazil | |
dc.subject.keyword | Bayesian statistics | en |
dc.subject.keyword | graph neural networks | en |
dc.subject.keyword | machine learning | en |
dc.subject.other | Computer science | en |
dc.title | Advances in distributed Bayesian inference and graph neural networks | en |
dc.type | G5 Artikkeliväitöskirja | fi |
dc.type.dcmitype | text | en |
dc.type.ontasot | Doctoral dissertation (article-based) | en |
dc.type.ontasot | Väitöskirja (artikkeli) | fi |
local.aalto.acrisexportstatus | checked 2021-11-29_1536 | |
local.aalto.archive | yes | |
local.aalto.formfolder | 2021_11_10_klo_12_36 | |
local.aalto.infra | Science-IT |
Files
Original bundle
1 - 3 of 3
No Thumbnail Available
- Name:
- isbn9789526406091.pdf
- Size:
- 893.36 KB
- Format:
- Adobe Portable Document Format
No Thumbnail Available
- Name:
- Thesis_errata.pdf
- Size:
- 58.64 KB
- Format:
- Adobe Portable Document Format
- Description:
No Thumbnail Available
- Name:
- Thesis_errata_II.pdf
- Size:
- 68.46 KB
- Format:
- Adobe Portable Document Format
- Description: