Meta distillation for reinforcement learning강화 학습을 위한 메타 디스틸레이션

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 222
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorHwang, Sung Ju-
dc.contributor.advisor황성주-
dc.contributor.authorNam, Taewook-
dc.date.accessioned2021-05-13T19:38:14Z-
dc.date.available2021-05-13T19:38:14Z-
dc.date.issued2020-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=925156&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/284995-
dc.description학위논문(석사) - 한국과학기술원 : 전산학부, 2020.8,[ii, 15 p. :]-
dc.description.abstractAs active research of deep reinforcement learning makes it possible to apply reinforcement learning to many high-dimensional environments, the sample efficiency of reinforcement learning has been more important. Learning strategy that utilizes background knowledge from previous tasks to new tasks, such as transfer learning and meta-learning, is one common approach for enhancing sample efficiency. In this work, we propose a meta-learning framework Meta-Distillation for Reinforcement Learning (MDRL) that efficiently transfers expert policies from previous environments to a new policy in an unseen environment. A weighted sum of discrepancies between current policy and expert policies is added to policy update loss, and the weights are determined by a weight network that is meta-trained to help training by considering tasks, training sample, and policy training progress. MDRL succeed to data-efficiently adapt new task when given distribution of environment is scarce and diverse.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectReinforcement Learning▼aMeta-Learning▼aMeta-RL▼aTransfer Learning▼aDistillation-
dc.subject강화 학습▼a메타 학습▼a메타 강화 학습▼a전이 학습▼a디스틸레이션-
dc.titleMeta distillation for reinforcement learning-
dc.title.alternative강화 학습을 위한 메타 디스틸레이션-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전산학부,-
dc.contributor.alternativeauthor남태욱-
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0