Hierarchical Reinforcement Learning Explains Task Interleaving Behavior
Loading...
Access rights
openAccess
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal
View/Open full text file from the Research portal
Other link related to publication
View publication in the Research portal
View/Open full text file from the Research portal
Other link related to publication
Date
2021-09
Department
Major/Subject
Mcode
Degree programme
Language
en
Pages
21
284-304
284-304
Series
Computational Brain & Behavior, Volume 4, issue 3
Abstract
How do people decide how long to continue in a task, when to switch, and to which other task? It is known that task interleaving adapts situationally, showing sensitivity to changes in expected rewards, costs, and task boundaries. However, the mechanisms that underpin the decision to stay in a task versus switch away are not thoroughly understood. Previous work has explained task interleaving by greedy heuristics and a policy that maximizes the marginal rate of return. However, it is unclear how such a strategy would allow for adaptation to environments that offer multiple tasks with complex switch costs and delayed rewards. Here, we develop a hierarchical model of supervisory control driven by reinforcement learning (RL). The core assumption is that the supervisory level learns to switch using task-specific approximate utility estimates, which are computed on the lower level. We show that a hierarchically optimal value function decomposition can be learned from experience, even in conditions with multiple tasks and arbitrary and uncertain reward and cost structures. The model also reproduces well-known key phenomena of task interleaving, such as the sensitivity to costs of resumption and immediate as well as delayed in-task rewards. In a demanding task interleaving study with 211 human participants and realistic tasks (reading, mathematics, question-answering, recognition), the model yielded better predictions of individual-level data than a flat (non-hierarchical) RL model and an omniscient-myopic baseline. Corroborating emerging evidence from cognitive neuroscience, our results suggest hierarchical RL as a plausible model of supervisory control in task interleaving.Description
Funding Information: Open access funding provided by Swiss Federal Institute of Technology Zurich. This work was funded in parts by the Swiss National Science Foundation (UFO 200021L_153644). Publisher Copyright: © 2020, The Author(s).
Keywords
Bayesian inference, Computational modeling, Hierarchical reinforcement learning, Hierarchical reinforcement learning model for task interleaving, Task interleaving
Other note
Citation
Gebhardt, C, Oulasvirta, A & Hilliges, O 2021, ' Hierarchical Reinforcement Learning Explains Task Interleaving Behavior ', Computational Brain & Behavior, vol. 4, no. 3, pp. 284-304 . https://doi.org/10.1007/s42113-020-00093-9