Domain Curiosity: Learning Efficient Data Collection Strategies for Domain Adaptation

Loading...
Thumbnail Image

Access rights

openAccess
acceptedVersion

URL

Journal Title

Journal ISSN

Volume Title

A4 Artikkeli konferenssijulkaisussa

Major/Subject

Mcode

Degree programme

Language

en

Pages

8

Series

Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2021, pp. 1259-1266, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems

Abstract

Domain adaptation is a common problem in robotics, with applications such as transferring policies from simulation to real world and lifelong learning. Performing such adaptation, however, requires informative data about the environment to be available during the adaptation. In this paper, we present domain curiosity—a method of training exploratory policies that are explicitly optimized to provide data that allows a model to learn about the unknown aspects of the environment. In contrast to most curiosity methods, our approach explicitly rewards learning, which makes it robust to environment noise without sacrificing its ability to learn. We evaluate the proposed method by comparing how much a model can learn about environment dynamics given data collected by the proposed approach, compared to standard curious and random policies. The evaluation is performed using a toy environment, two simulated robot setups, and on a real-world haptic exploration task. The results show that the proposed method allows data-efficient and accurate estimation of dynamics.

Description

Other note

Citation

Arndt, K, Struckmeier, O & Kyrki, V 2021, Domain Curiosity: Learning Efficient Data Collection Strategies for Domain Adaptation. in Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2021., 9635864, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, IEEE, pp. 1259-1266, IEEE/RSJ International Conference on Intelligent Robots and Systems, Prague, Czech Republic, 27/09/2021. https://doi.org/10.1109/IROS51168.2021.9635864