학위논문(박사) - 한국과학기술원 : 로봇공학학제전공, 2013.2, [ vii, 100 p. ]
Reinforcement learning; actor-critic; local model; policy gradient; 강화학습; 액터-크리틱; 지역 모델; 정책기울기; 함수 추정; function approximation
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.