Palette View Synthesis - Novel View Synthesis using Diffusion Probabilistic Modelling
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.advisor | Deny, Stéphane | |
dc.contributor.author | Spiegl, Bernard | |
dc.contributor.school | Sähkötekniikan korkeakoulu | fi |
dc.contributor.supervisor | Ilin, Alexander | |
dc.date.accessioned | 2023-12-18T16:56:42Z | |
dc.date.available | 2023-12-18T16:56:42Z | |
dc.date.issued | 2023-12-11 | |
dc.description.abstract | Novel view synthesis is a class of computer vision problems, in which one or multiple views of a scene or an object are provided. The goal is then to produce novel, previously unseen views of the given scene or object. Recently, the endeavors to solve such problems have gained significant traction in the generative deep learning domain. From Neural Radiance Field (NeRF) based approaches to encoder-decoder style architectures, various ways of performing novel view synthesis have been previously introduced. This work introduces Palette View Synthesis, an end-to-end diffusion probabilistic generative modelling approach for performing novel view synthesis which aims to resolve the drawbacks of previous approaches by extending the model's abilities to generalize across multiple classes, given only a single view and a target angle of the object as inputs, while simultaneously maintaining the quality of the generated samples. It shows that by employing a diffusion-based model, with a simple U-Net backbone that parameterizes the denoising function, and concatenation along the input channel dimension as a form of conditioning, it is possible to produce high quality, believable novel views while simultaneously generalizing across multiple different classes. | en |
dc.format.extent | 39+3 | |
dc.format.mimetype | application/pdf | en |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/124957 | |
dc.identifier.urn | URN:NBN:fi:aalto-202312187325 | |
dc.language.iso | en | en |
dc.location | P1 | fi |
dc.programme | CCIS - Master’s Programme in Computer, Communication and Information Sciences (TS2013) | fi |
dc.programme.major | Signal Processing and Data Science | fi |
dc.programme.mcode | ELEC3049 | fi |
dc.subject.keyword | novel view synthesis | en |
dc.subject.keyword | diffusion probabilistic modelling | en |
dc.subject.keyword | generative modelling | en |
dc.subject.keyword | deep learning | en |
dc.subject.keyword | mental rotation | en |
dc.title | Palette View Synthesis - Novel View Synthesis using Diffusion Probabilistic Modelling | en |
dc.type | G2 Pro gradu, diplomityö | fi |
dc.type.ontasot | Master's thesis | en |
dc.type.ontasot | Diplomityö | fi |
local.aalto.electroniconly | yes | |
local.aalto.openaccess | yes |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- master_Spiegl_Bernard_2023.pdf
- Size:
- 12.13 MB
- Format:
- Adobe Portable Document Format