User controlled Exploration and Exploitaion Search in Multi-Armed Bandits Using ToM(theory of mind)
Loading...
URL
Journal Title
Journal ISSN
Volume Title
Perustieteiden korkeakoulu |
Master's thesis
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Authors
Date
2021-08-23
Department
Major/Subject
Machine Learning, Data Science and Artificial Intelligence
Mcode
SCI3044
Degree programme
Master’s Programme in Computer, Communication and Information Sciences
Language
en
Pages
18+0
Series
Abstract
The interactive Information Retrieval (IR) system rely on user relevance feedback to update the set of recommendation. However, the user intent while providing such feedback need not necessarily mean relevant/irrelevant. These kinds of feedbacks, if not accounted properly, lead to misinterpretation and biases the learning of the user intent. As users increasingly interact with AI system, they form a metal model of it. And thus they use this model to steer the AI towards their true intent by providing relevance feedback such that it would yield them the desired result. This thesis propose an system which lets users directly interact with the AI's theory of mind and there by aid in steering it towards their true intent. This is achieved by providing an interaction component along with relevance feedback using which the users can control the AI better. The thesis also discusses a metric using which a user can steer the AI towards his/her true intent. Preliminary examination of result shows that such kind of interactive component can use used in interactive Information Retrieval system to speed up the process of information retrieval.Description
Supervisor
Kaski, SamuelThesis advisor
Daee, PedramKeywords
information retrival, multi-armed bandit, theory of mind, exploration