Fast adaptation of neural networks

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorIlin, Alexander
dc.contributor.authorBoney, Rinu
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.supervisorKannala, Juho
dc.date.accessioned2018-04-03T13:27:04Z
dc.date.available2018-04-03T13:27:04Z
dc.date.issued2018-03-19
dc.description.abstractThe ability to learn quickly from a few samples is a vital element of intelligence. Humans can reuse past knowledge and learn incredibly quickly. Also humans are able to interact with others to effectively guide their learning process. Computer vision systems for recognizing objects automatically from pixels are becoming commonplace in production systems. These modern computer vision systems use deep neural networks to automatically learn and recognize objects from data. Oftentimes, these deep neural networks used in production require a lot of data, take a long time to learn and forget old things when learning something new. We build upon previous methods called Prototypical Networks and Model-Agnostic Meta-Learning (MAML) that enables machines to learn to recognize new objects with very little supervision from the user. We extend these methods to the semi-supervised few-shot learning scenario, where the few labeled samples are accompanied with (potentially many) unlabeled samples. Our proposed methods are able to learn better by also making use of the additional unlabeled samples. We note that in many real-world applications the adaptation performance can be significantly improved by requesting the few labels through user feedback (active adaptation). Further, our proposed methods can also adapt to new tasks without any labeled examples (unsupervised adaptation) when the new task has the same output space as the training tasks do.en
dc.format.extent46
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/30540
dc.identifier.urnURN:NBN:fi:aalto-201804032004
dc.language.isoenen
dc.programmeMaster’s Programme in Computer, Communication and Information Sciencesfi
dc.programme.majorMachine Learning and Data Miningfi
dc.programme.mcodeSCI3044fi
dc.subject.keyworddeep learningen
dc.subject.keywordactive learningen
dc.subject.keywordfew-shot learningen
dc.subject.keywordmeta-learningen
dc.titleFast adaptation of neural networksen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessyes

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
master_Boney_Rinu_2018.pdf
Size:
2.01 MB
Format:
Adobe Portable Document Format