User controlled Exploration and Exploitaion Search in Multi-Armed Bandits Using ToM(theory of mind)

Loading...
Thumbnail Image

URL

Journal Title

Journal ISSN

Volume Title

Perustieteiden korkeakoulu | Master's thesis

Date

2021-08-23

Department

Major/Subject

Machine Learning, Data Science and Artificial Intelligence

Mcode

SCI3044

Degree programme

Master’s Programme in Computer, Communication and Information Sciences

Language

en

Pages

18+0

Series

Abstract

The interactive Information Retrieval (IR) system rely on user relevance feedback to update the set of recommendation. However, the user intent while providing such feedback need not necessarily mean relevant/irrelevant. These kinds of feedbacks, if not accounted properly, lead to misinterpretation and biases the learning of the user intent. As users increasingly interact with AI system, they form a metal model of it. And thus they use this model to steer the AI towards their true intent by providing relevance feedback such that it would yield them the desired result. This thesis propose an system which lets users directly interact with the AI's theory of mind and there by aid in steering it towards their true intent. This is achieved by providing an interaction component along with relevance feedback using which the users can control the AI better. The thesis also discusses a metric using which a user can steer the AI towards his/her true intent. Preliminary examination of result shows that such kind of interactive component can use used in interactive Information Retrieval system to speed up the process of information retrieval.

Description

Supervisor

Kaski, Samuel

Thesis advisor

Daee, Pedram

Keywords

information retrival, multi-armed bandit, theory of mind, exploration

Other note

Citation