Palette View Synthesis - Novel View Synthesis using Diffusion Probabilistic Modelling

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorDeny, Stéphane
dc.contributor.authorSpiegl, Bernard
dc.contributor.schoolSähkötekniikan korkeakoulufi
dc.contributor.supervisorIlin, Alexander
dc.date.accessioned2023-12-18T16:56:42Z
dc.date.available2023-12-18T16:56:42Z
dc.date.issued2023-12-11
dc.description.abstractNovel view synthesis is a class of computer vision problems, in which one or multiple views of a scene or an object are provided. The goal is then to produce novel, previously unseen views of the given scene or object. Recently, the endeavors to solve such problems have gained significant traction in the generative deep learning domain. From Neural Radiance Field (NeRF) based approaches to encoder-decoder style architectures, various ways of performing novel view synthesis have been previously introduced. This work introduces Palette View Synthesis, an end-to-end diffusion probabilistic generative modelling approach for performing novel view synthesis which aims to resolve the drawbacks of previous approaches by extending the model's abilities to generalize across multiple classes, given only a single view and a target angle of the object as inputs, while simultaneously maintaining the quality of the generated samples. It shows that by employing a diffusion-based model, with a simple U-Net backbone that parameterizes the denoising function, and concatenation along the input channel dimension as a form of conditioning, it is possible to produce high quality, believable novel views while simultaneously generalizing across multiple different classes.en
dc.format.extent39+3
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/124957
dc.identifier.urnURN:NBN:fi:aalto-202312187325
dc.language.isoenen
dc.locationP1fi
dc.programmeCCIS - Master’s Programme in Computer, Communication and Information Sciences (TS2013)fi
dc.programme.majorSignal Processing and Data Sciencefi
dc.programme.mcodeELEC3049fi
dc.subject.keywordnovel view synthesisen
dc.subject.keyworddiffusion probabilistic modellingen
dc.subject.keywordgenerative modellingen
dc.subject.keyworddeep learningen
dc.subject.keywordmental rotationen
dc.titlePalette View Synthesis - Novel View Synthesis using Diffusion Probabilistic Modellingen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessyes

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
master_Spiegl_Bernard_2023.pdf
Size:
12.13 MB
Format:
Adobe Portable Document Format