DSpace at KOASAS: Imitating and Finetuning Model Predictive Control for Robust and Symmetric Quadrupedal Locomotion

DSpace at KOASAS

College of Engineering(공과대학)School of Mechanical and Aerospace Engineering(기계항공공학부)Dept. of Mechanical Engineering(기계공학과)ME-Journal Papers(저널논문)

Imitating and Finetuning Model Predictive Control for Robust and Symmetric Quadrupedal Locomotion

Cited 1 time in

Cited 0 time in

Hit : 96
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Youm, Donghoon	ko
dc.contributor.author	Jung, Hyunyoung	ko
dc.contributor.author	Kim, Hyeongjun	ko
dc.contributor.author	Hwangbo, Jemin	ko
dc.contributor.author	Ha, Sehoon	ko
dc.contributor.author	Park, Hae-Won	ko
dc.date.accessioned	2023-12-08T09:00:31Z	-
dc.date.available	2023-12-08T09:00:31Z	-
dc.date.created	2023-12-08	-
dc.date.created	2023-12-08	-
dc.date.created	2023-12-08	-
dc.date.issued	2023-11	-
dc.identifier.citation	IEEE Robotics and Automation Letters, v.8, no.11, pp.7799 - 7806	-
dc.identifier.issn	2377-3766	-
dc.identifier.uri	http://hdl.handle.net/10203/316089	-
dc.description.abstract	Control of legged robots is a challenging problem that has been investigated by different approaches, such as model-based control and learning algorithms. This work proposes a novel Imitating and Finetuning Model Predictive Control (IFM) framework to take the strengths of both approaches. Our framework first develops a conventional model predictive controller (MPC) using Differential Dynamic Programming and Raibert heuristic, which serves as an expert policy. Then we train a clone of the MPC using imitation learning to make the controller learnable. Finally, we leverage deep reinforcement learning with limited exploration for further finetuning the policy on more challenging terrains. By conducting comprehensive simulation and hardware experiments, we demonstrate that the proposed IFM framework can significantly improve the performance of the given MPC controller on rough, slippery, and conveyor terrains that require careful coordination of footsteps. We also showcase that IFM can efficiently produce more symmetric, periodic, and energy-efficient gaits compared to Vanilla RL with a minimal burden of reward shaping. © 2016 IEEE.	-
dc.language	English	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Imitating and Finetuning Model Predictive Control for Robust and Symmetric Quadrupedal Locomotion	-
dc.type	Article	-
dc.identifier.wosid	001142485900002	-
dc.identifier.scopusid	2-s2.0-85173012210	-
dc.type.rims	ART	-
dc.citation.volume	8	-
dc.citation.issue	11	-
dc.citation.beginningpage	7799	-
dc.citation.endingpage	7806	-
dc.citation.publicationname	IEEE Robotics and Automation Letters	-
dc.identifier.doi	10.1109/LRA.2023.3320827	-
dc.contributor.localauthor	Hwangbo, Jemin	-
dc.contributor.localauthor	Park, Hae-Won	-
dc.contributor.nonIdAuthor	Jung, Hyunyoung	-
dc.contributor.nonIdAuthor	Kim, Hyeongjun	-
dc.contributor.nonIdAuthor	Ha, Sehoon	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	imitation learning	-
dc.subject.keywordAuthor	Legged robots	-
dc.subject.keywordAuthor	reinforcement learning	-

Appears in Collection: ME-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 1 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Imitating and Finetuning Model Predictive Control for Robust and Symmetric Quadrupedal Locomotion

This item is cited by other documents in WoS

KOASAS

Communities & Collections