Theory of Mind Based Models in Human-AI Interaction

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorPeltola, Tomi
dc.contributor.authorÇelikok, Mustafa
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.supervisorKaski, Samuel
dc.date.accessioned2018-12-14T16:08:25Z
dc.date.available2018-12-14T16:08:25Z
dc.date.issued2018-12-10
dc.description.abstractHumans are social animals. They have goals, they make plans, they collaborate and compete. The richness of human-human interaction is immense. Yet, the way modern AI systems model their interaction with human users does not take these aspects into account. Often times human feedback is modelled as samples from an unknown but fixed probability distribution. These models are not able to capture the active planning aspect of real humans. The underlying motivation of this thesis is that the performance of human-AI collaboration is limited by the parties' ability of modelling each others' minds. In human-human interaction, this ability is called the theory of mind, and it is shown to be a limiting factor in human teams' task performance by cognitive science studies. In order to examine the effects of having theory of mind based user models, we define a multi-armed bandit setting where the system takes into account that the user is able to anticipate the system's behaviour multiple steps ahead, and strategically plan her feedback. We compare the performance of our proposed setting to the standard multi-armed bandit setting where the feedback is assumed to be samples from an unknown probability distribution. Empirical results demonstrate that better reward performance and ranking of arms are achieved when users can behave strategically and the system takes this into account. The results indicate that the performance of human-AI teams increase based on how well the parties can model each other and use their models to plan their interaction.en
dc.format.extent49
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/35517
dc.identifier.urnURN:NBN:fi:aalto-201812146533
dc.language.isoenen
dc.programmeMaster’s Programme in Computer, Communication and Information Sciencesfi
dc.programme.majorMachine Learning, Data Science and Artificial Intelligencefi
dc.programme.mcodeSCI3044fi
dc.subject.keywordBayesian modellingen
dc.subject.keywordhuman-AI collaborationen
dc.subject.keywordinteractive systemsen
dc.subject.keywordinverse reinforcement learningen
dc.subject.keywordmulti-armed banditsen
dc.subject.keywordtheory of minden
dc.titleTheory of Mind Based Models in Human-AI Interactionen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessno

Files