Imitating and Finetuning Model Predictive Control for Robust and Symmetric Quadrupedal Locomotion

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 96
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorYoum, Donghoonko
dc.contributor.authorJung, Hyunyoungko
dc.contributor.authorKim, Hyeongjunko
dc.contributor.authorHwangbo, Jeminko
dc.contributor.authorHa, Sehoonko
dc.contributor.authorPark, Hae-Wonko
dc.date.accessioned2023-12-08T09:00:31Z-
dc.date.available2023-12-08T09:00:31Z-
dc.date.created2023-12-08-
dc.date.created2023-12-08-
dc.date.created2023-12-08-
dc.date.issued2023-11-
dc.identifier.citationIEEE Robotics and Automation Letters, v.8, no.11, pp.7799 - 7806-
dc.identifier.issn2377-3766-
dc.identifier.urihttp://hdl.handle.net/10203/316089-
dc.description.abstractControl of legged robots is a challenging problem that has been investigated by different approaches, such as model-based control and learning algorithms. This work proposes a novel Imitating and Finetuning Model Predictive Control (IFM) framework to take the strengths of both approaches. Our framework first develops a conventional model predictive controller (MPC) using Differential Dynamic Programming and Raibert heuristic, which serves as an expert policy. Then we train a clone of the MPC using imitation learning to make the controller learnable. Finally, we leverage deep reinforcement learning with limited exploration for further finetuning the policy on more challenging terrains. By conducting comprehensive simulation and hardware experiments, we demonstrate that the proposed IFM framework can significantly improve the performance of the given MPC controller on rough, slippery, and conveyor terrains that require careful coordination of footsteps. We also showcase that IFM can efficiently produce more symmetric, periodic, and energy-efficient gaits compared to Vanilla RL with a minimal burden of reward shaping. © 2016 IEEE.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleImitating and Finetuning Model Predictive Control for Robust and Symmetric Quadrupedal Locomotion-
dc.typeArticle-
dc.identifier.wosid001142485900002-
dc.identifier.scopusid2-s2.0-85173012210-
dc.type.rimsART-
dc.citation.volume8-
dc.citation.issue11-
dc.citation.beginningpage7799-
dc.citation.endingpage7806-
dc.citation.publicationnameIEEE Robotics and Automation Letters-
dc.identifier.doi10.1109/LRA.2023.3320827-
dc.contributor.localauthorHwangbo, Jemin-
dc.contributor.localauthorPark, Hae-Won-
dc.contributor.nonIdAuthorJung, Hyunyoung-
dc.contributor.nonIdAuthorKim, Hyeongjun-
dc.contributor.nonIdAuthorHa, Sehoon-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorimitation learning-
dc.subject.keywordAuthorLegged robots-
dc.subject.keywordAuthorreinforcement learning-
Appears in Collection
ME-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0