Concurrent training of a control policy and a state estimator for dynamic and robust legged locomotion사족로봇의 고속보행제어를 위한 정책 및 상태추정 동시학습 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 267
  • Download : 0
In this paper, we propose a locomotion training framework where a control policy and a state estimator are trained concurrently. The framework consists of a policy network which outputs the desired joint positions and a state estimation network which outputs estimates of the robot’s states such as the base linear velocity, foot height, and contact probability. We exploit a fast simulation environment to train the networks and the trained networks are transferred to the real robot. The trained policy and state estimator are capable of traversing diverse terrains such as a hill, slippery plate, and bumpy road. We also demonstrate that the learned policy can run at up to 3.75 m/s on normal flat ground and 3.54 m/s on a slippery plate with the coefficient of friction of 0.22.
Advisors
Hwangbo, Jeminresearcher황보제민researcher
Description
한국과학기술원 :기계공학과,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 기계공학과, 2023.2,[iii, 25 p. :]

Keywords

Legged robots▼aReinforcement learning▼aHigh-speed locomotion; 보행 로봇▼a강화 학습▼a고속 보행

URI
http://hdl.handle.net/10203/308136
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1032285&flag=dissertation
Appears in Collection
ME-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0