DROPO: Sim-to-real transfer with offline domain randomization

Loading...
Thumbnail Image
Access rights
openAccess
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
Date
2023-08
Major/Subject
Mcode
Degree programme
Language
en
Pages
15
Series
Robotics and Autonomous Systems, Volume 166
Abstract
In recent years, domain randomization over dynamics parameters has gained a lot of traction as a method for sim-to-real transfer of reinforcement learning policies in robotic manipulation; however, finding optimal randomization distributions can be difficult. In this paper, we introduce DROPO, a novel method for estimating domain randomization distributions for safe sim-to-real transfer. Unlike prior work, DROPO only requires a limited, precollected offline dataset of trajectories, and explicitly models parameter uncertainty to match real data using a likelihood-based approach. We demonstrate that DROPO is capable of recovering dynamic parameter distributions in simulation and finding a distribution capable of compensating for an unmodeled phenomenon. We also evaluate the method in two zero-shot sim-to-real transfer scenarios, showing successful domain transfer and improved performance over prior methods.
Description
Funding Information: This work was supported by Academy of Finland grants 317020 and 328399 . We acknowledge the computational resources generously provided by CSC – IT Center for Science, Finland, and by the Aalto Science-IT project. Publisher Copyright: © 2023 The Author(s)
Keywords
Domain randomization, Reinforcement learning, Robot learning, Transfer learning
Other note
Citation
Tiboni, G, Arndt, K & Kyrki, V 2023, ' DROPO: Sim-to-real transfer with offline domain randomization ', Robotics and Autonomous Systems, vol. 166, 104432 . https://doi.org/10.1016/j.robot.2023.104432