Modelling Human Decision-making based on Aggregate Observation Data

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorKangasrääsiö, Anttien_US
dc.contributor.authorKaski, Samuelen_US
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorCentre of Excellence in Computational Inference, COINen
dc.contributor.groupauthorProfessorship Kaski Samuelen
dc.contributor.groupauthorHelsinki Institute for Information Technology (HIIT)en
dc.contributor.groupauthorProbabilistic Machine Learningen
dc.date.accessioned2019-02-25T08:42:39Z
dc.date.available2019-02-25T08:42:39Z
dc.date.issued2017en_US
dc.description.abstractBeing able to infer the goals, preferences and limitations of humans is of key importance in designing interactive systems. Reinforcement learning (RL) models are a promising direction of research, as they are able to model how the behavioural patterns of users emerge from the task and environment structure. One limitation with traditional inference methods for RL models is the strict requirements for observation data; both the states of the environment and the actions of the agent need to be observed at each step of the task. This has prevented RL models from being used in situations where such fine-grained observations are not available. In this extended abstract we present results from a recent study where we demonstrated how inference can be performed for RL models even when the observation data is significantly more coarse-grained. The idea is to solve the inverse reinforcement learning (IRL) problem using approximate Bayesian computation sped up with Bayesian optimization.en
dc.description.versionPeer revieweden
dc.format.extent4
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationKangasrääsiö, A & Kaski, S 2017, Modelling Human Decision-making based on Aggregate Observation Data . in Human In The Loop-ML Workshop at ICML . Human in the Loop Machine Learning, Sydney, Human in the Loop Machine Learning; ICML Workshop, Sydney, Australia, 11/08/2017 .en
dc.identifier.otherPURE UUID: 25d9924d-3936-4f4e-94ab-7164d9c5e896en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/25d9924d-3936-4f4e-94ab-7164d9c5e896en_US
dc.identifier.otherPURE LINK: https://machlearn.gitlab.io/hitl2017/en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/14255048/ICML17_WS.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/36680
dc.identifier.urnURN:NBN:fi:aalto-201902251837
dc.language.isoenen
dc.relation.ispartofHuman in the Loop Machine Learning; ICML Workshopen
dc.relation.ispartofseriesHuman In The Loop-ML Workshop at ICMLen
dc.rightsopenAccessen
dc.titleModelling Human Decision-making based on Aggregate Observation Dataen
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionacceptedVersion

Files