Advances in distributed Bayesian inference and graph neural networks

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorMesquita, Diego
dc.contributor.departmentTietotekniikan laitosfi
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.labProbabilistic Machine Learning (PML)en
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorKaski, Samuel, Prof., Aalto University, Department of Computer Science, Finland
dc.date.accessioned2021-11-11T10:00:07Z
dc.date.available2021-11-11T10:00:07Z
dc.date.defence2021-11-24
dc.date.issued2021
dc.descriptionDefence is held on 24.11.2021 12:00 – 16:00 Zoom, https://aalto.zoom.us/j/6031768727
dc.description.abstractBayesian statistics and graph neural networks comprise a bag of tools widely employed in machine learning and applied sciences. The former rests on solid theoretical foundations, but its application depends on techniques that scale poorly as data increase. The latter is notorious for large-scale applications (e.g., in bioinformatics and natural language processing), but is largely only based on empirical intuitions. This thesis aims to i) broaden the scope of applications for Bayesian inference, and ii) deepen the understanding of core design principles of graph neural networks. First, we focus on distributed Bayesian inference under limited communication. We advance the state-of-the-art of embarrassingly parallel Markov chain Monte Carlo (MCMC) with a novel method that leverages normalizing flows as density estimators. On the same front, we also propose an extension of stochastic gradient Langevin dynamics for federated data, which are inherently distributed in a non-IID manner and cannot be centralized due to privacy constraints. Second, we develop a methodology for meta-analysis which allows the combination of Bayesian posteriors from different studies. Our approach is agnostic to study-specific complexities, which are all encapsulated in their respective posteriors. This extends the application of Bayesian meta-analysis to likelihood-free posteriors, which would otherwise be challenging. Our method also enables us to reuse posteriors from computationally costly analyses and update them post-hoc, without rerunning the analyses. Finally, we revisit two popular graph neural network components: spectral graph convolutions and pooling layers. Regarding convolutions, we propose a novel architecture and show that it is possible to achieve state-of-the-art performance by adding a minimal set of features to the most basic formulation of polynomial spectral convolutions. On the topic of pooling, we challenge the need for intricate pooling schemes and show that they do not play a role in the performance of graph networks in relevant benchmarks.en
dc.format.extent41 + app. 92
dc.format.mimetypeapplication/pdfen
dc.identifier.isbn978-952-64-0609-1 (electronic)
dc.identifier.isbn978-952-64-0608-4 (printed)
dc.identifier.issn1799-4942 (electronic)
dc.identifier.issn1799-4934 (printed)
dc.identifier.issn1799-4934 (ISSN-L)
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/110937
dc.identifier.urnURN:ISBN:978-952-64-0609-1
dc.language.isoenen
dc.opnVergari, Antonio, Prof., University of Edinburgh, UK
dc.publisherAalto Universityen
dc.publisherAalto-yliopistofi
dc.relation.haspart[Publication 1]: Diego Mesquita, Paul Blomstedt, and Samuel Kaski. Embarrassingly Parallel MCMC with deep invertible transformations. In Uncertainty in Artificial Intelligence, Tel-Aviv, Israel, July 2019. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-2020123160479.
dc.relation.haspart[Publication 2]: Khaoula el Mekkaoui, Diego Mesquita, Paul Blomstedt, and Samuel Kaski. Federated stochastic gradient Langevin dynamics. In Uncertainty in Artificial Intelligence, Online, July 2021
dc.relation.haspart[Publication 3]: Paul Blomstedt, Diego Mesquita, Jarno Lintuusari, Tuomas Sivula, Jukka Corander, and Samuel kaski. Meta-analysis of Bayesian analyses. Submitted to a journal, 2020
dc.relation.haspart[Publication 4]: Diego Mesquita, Amauri Souza, and Samuel Kaski. Rethinking pooling in graph neural networks. In Advances in neural information processing systems, Online, December 2020. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-202102021917.
dc.relation.haspart[Publication 5]: Hojin Kang, Jou-hui Ho, Diego Mesquita, Amauri Souza, Jorge Pérez, and Samuel Kaski. Spectral Graph Networks with Constrained Polynomials. Submitted to a journal, 2021
dc.relation.ispartofseriesAalto University publication series DOCTORAL DISSERTATIONSen
dc.relation.ispartofseries166/2021
dc.revde Campos, Cassio, Prof., Eindhoven University of Technology, Netherlands
dc.revLamb, Luis, Prof., Federal University of Rio Grande do Sul, Brazil
dc.subject.keywordBayesian statisticsen
dc.subject.keywordgraph neural networksen
dc.subject.keywordmachine learningen
dc.subject.otherComputer scienceen
dc.titleAdvances in distributed Bayesian inference and graph neural networksen
dc.typeG5 Artikkeliväitöskirjafi
dc.type.dcmitypetexten
dc.type.ontasotDoctoral dissertation (article-based)en
dc.type.ontasotVäitöskirja (artikkeli)fi
local.aalto.acrisexportstatuschecked 2021-11-29_1536
local.aalto.archiveyes
local.aalto.formfolder2021_11_10_klo_12_36
local.aalto.infraScience-IT

Files

Original bundle

Now showing 1 - 3 of 3
No Thumbnail Available
Name:
isbn9789526406091.pdf
Size:
893.36 KB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
Thesis_errata.pdf
Size:
58.64 KB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
Thesis_errata_II.pdf
Size:
68.46 KB
Format:
Adobe Portable Document Format
Description: