Data-efficient Reinforcement Learning for Variable Impedance Control

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorAnand, Akhil S.en_US
dc.contributor.authorKaushik, Riturajen_US
dc.contributor.authorGravdahl, Jan Tommyen_US
dc.contributor.authorAbu-Dakka, Fares J.en_US
dc.contributor.departmentDepartment of Electrical Engineering and Automationen
dc.contributor.organizationNorwegian University of Science and Technologyen_US
dc.date.accessioned2024-02-07T08:15:30Z
dc.date.available2024-02-07T08:15:30Z
dc.date.issued2024en_US
dc.descriptionPublisher Copyright: Authors
dc.description.abstractOne of the most crucial steps toward achieving human-like manipulation skills in robots is to incorporate compliance into the robot controller. Compliance not only makes the robot’s behaviour safe but also makes it more energy efficient. In this direction, the variable impedance control (VIC) approach provides a framework for a robot to adapt its compliance during execution by employing an adaptive impedance law. Nevertheless, autonomously adapting the compliance profile as demanded by the task remains a challenging problem to be solved in practice. In this work, we introduce a reinforcement learning (RL)-based approach called DEVILC (Data-Efficient Variable Impedance Learning Controller) to learn the variable impedance controller through real-world interaction of the robot. More concretely, we use a model-based RL approach in which, after every interaction, the robot iteratively learns a probabilistic model of its dynamics using the Gaussian process regression model. The model is then used to optimize a neural-network policy that modulates the robot’s impedance such that the long-term reward for the task is maximized. Thanks to the model-based RL framework, DEVILC allows a robot to learn the VIC policy with only a few interactions, making it practical for real-world applications. In simulations and experiments, we evaluate DEVILC on a Franka Emika Panda robotic manipulator for different manipulation tasks in the Cartesian space. The results show that DEVILC is a promising direction toward autonomously learning compliant manipulation skills directly in the real world through interactions. A video of the experiments is available in the link: https://youtu.be/_uyr0Vye5noen
dc.description.versionPeer revieweden
dc.format.extent11
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationAnand, A S, Kaushik, R, Gravdahl, J T & Abu-Dakka, F J 2024, 'Data-efficient Reinforcement Learning for Variable Impedance Control', IEEE Access, vol. 12, pp. 15631-15641. https://doi.org/10.1109/ACCESS.2024.3355311en
dc.identifier.doi10.1109/ACCESS.2024.3355311en_US
dc.identifier.issn2169-3536
dc.identifier.otherPURE UUID: 1378c5fa-26f0-4ede-93d4-bbb29f52f0fden_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/1378c5fa-26f0-4ede-93d4-bbb29f52f0fden_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/135879422/Data-Efficient_Reinforcement_Learning_for_Variable_Impedance_Control.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/126678
dc.identifier.urnURN:NBN:fi:aalto-202402072337
dc.language.isoenen
dc.publisherIEEE
dc.relation.ispartofseriesIEEE Accessen
dc.relation.ispartofseriesVolume 12, pp. 15631-15641en
dc.rightsopenAccessen
dc.subject.keywordAdaptation modelsen_US
dc.subject.keywordAerospace electronicsen_US
dc.subject.keywordCovariance matrix adaptationen_US
dc.subject.keywordGaussian processesen_US
dc.subject.keywordImpedanceen_US
dc.subject.keywordJacobian matricesen_US
dc.subject.keywordModel-based reinforcement learningen_US
dc.subject.keywordReinforcement learningen_US
dc.subject.keywordRobotsen_US
dc.subject.keywordTask analysisen_US
dc.subject.keywordVariable impedance learning controlen_US
dc.titleData-efficient Reinforcement Learning for Variable Impedance Controlen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Data-Efficient_Reinforcement_Learning_for_Variable_Impedance_Control.pdf
Size:
1.49 MB
Format:
Adobe Portable Document Format