Differentially deep Gaussian processes
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.advisor | Heinonen, Markus | |
dc.contributor.author | Hegde, Pashupati | |
dc.contributor.school | Perustieteiden korkeakoulu | fi |
dc.contributor.supervisor | Kaski, Samuel | |
dc.date.accessioned | 2019-02-03T16:04:50Z | |
dc.date.available | 2019-02-03T16:04:50Z | |
dc.date.issued | 2019-01-28 | |
dc.description.abstract | Many modern machine learning methods, including deep neural networks, utilize a discrete sequence of parametric transformations to learn complex functions. Neural network based approaches can be an attractive choice for many real-world problems especially because of their modular nature. Gaussian process based methods, on the other hand, pose function approximation as a probabilistic inference problem by specifying prior distributions on unknown functions. Further, these probabilistic non-linear models provide well-calibrated uncertainty estimates which can be useful in many applications. However, the flexibility of these models depends on the choice of the kernel; handcrafting problem-specific kernels can be difficult in practice. Recently, deep Gaussian processes, a way of stacking multiple layers of Gaussian processes, was proposed as a flexible way of expanding model capacity. In this thesis, we propose a novel probabilistic deep learning approach by formulating stochastic differential transformations or `flows' of inputs using Gaussian processes. This provides continuous-time `flows' as an alternative to the traditional approach of a discrete sequence of transformations using `layers'. Moreover, the proposed approach can also be seen as an approximation to very deep Gaussian processes with infinitesimal increments across layers. We also derive a scalable inference method based on variational sparse approximations for Gaussian processes. The proposed model shows excellent results on various experiments on real-world datasets, as compared to the other popular probabilistic approaches including deep Gaussian processes and Bayesian neural networks. | en |
dc.format.extent | 47 | |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/36367 | |
dc.identifier.urn | URN:NBN:fi:aalto-201902031536 | |
dc.language.iso | en | en |
dc.programme | Master’s Programme in Computer, Communication and Information Sciences | fi |
dc.programme.major | Machine Learning and Data Mining | fi |
dc.programme.mcode | SCI3044 | fi |
dc.subject.keyword | Bayesian deep learning | en |
dc.subject.keyword | Bayesian nonparametrics | en |
dc.subject.keyword | Gaussian processes | en |
dc.subject.keyword | stochastic methods | en |
dc.title | Differentially deep Gaussian processes | en |
dc.type | G2 Pro gradu, diplomityö | fi |
dc.type.ontasot | Master's thesis | en |
dc.type.ontasot | Diplomityö | fi |
local.aalto.electroniconly | yes | |
local.aalto.openaccess | no |