Humanoid push recovery based on reinforcement learning in continuous state and action space연속 상태.행동공간에서 강화학습에 기반한 인간형 로봇의 넘어짐 방지

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 563
  • Download : 0
Research on biped humanoid robots is currently one of the most exciting topics in the field of robotics and among those issues, how to recover from the external push is typical one of the problems related to the robot stability. Different from the walking or other initiative motions, in push recovering, the humanoid has diffculties in taking actions in a short period when the unknown push occurs. Because of the complexity, in order to give a reference control to recover the robots from the push the more robust learning methods are needed. Moreover while imitating the human and animals, robots are preferred to learn the method in endless trials. Therefore the reinforcement learning (RL) is considered. In this thesis, a model for the push recovery is proposed as a reference trajectory. On the other hand, one of the reinforcement learning methods which is used in the continuous state and action space is introduced and applied to the push recovery with unknown push. Also, based on this RL method a with-model RL and a modified RL are proposed to realize the application of RL on the robot control. The results of the simulations are given to indicate the performance and are compared with each other to show the advantages of each method.
Advisors
Lee, Ju-Jangresearcher이주장researcher
Description
한국과학기술원 : 전기 및 전자공학과,
Publisher
한국과학기술원
Issue Date
2010
Identifier
419083/325007  / 020083972
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전기 및 전자공학과, 2010.2, [ viii, 58 p. ]

Keywords

넘어짐 방지; 인간형 로봇; 강화학습; reinforcement learning; push recovery; humanoid

URI
http://hdl.handle.net/10203/36559
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=419083&flag=dissertation
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0