DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Sung, Youngchul | - |
dc.contributor.advisor | 성영철 | - |
dc.contributor.author | Choi, Sungho | - |
dc.date.accessioned | 2021-05-13T19:34:27Z | - |
dc.date.available | 2021-05-13T19:34:27Z | - |
dc.date.issued | 2020 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=911414&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/284784 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2020.2,[ii, 25 p. :] | - |
dc.description.abstract | This study suggested a cross-domain imitation learning method where the agent in one domain follows the behavior of an expert in another domain without accessing any reward function. One common method is to extract domain-independent feature vectors from inputs and determine how well the agent is trained. However, due to the difference between two domains, the behavior of the agent cannot be compared appropriately with that of the expert regardless of its degree of training. To overcome this problem, we extracted two feature vectors from each input | - |
dc.description.abstract | one contained only domain information and the other contained only policy expertness information. The expert data in target-domain can be produced by combining the target-domain feature vector and the expert-policy feature vector. The proposed method showed improved performance in several environments with domain discrepancies. From this study, we can expect to alleviate the difficulties of collecting demonstrations and increase the efficiency of training agents using imitation learning. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | reinforcement learning▼aimitation learning▼adomain adaptation▼afeature synthesis▼aadversarial training | - |
dc.subject | 강화학습▼a모방학습▼a영역적응▼a특성합성▼a적대적 학습 | - |
dc.title | Cross-domain imitation learning with feature synthesis | - |
dc.title.alternative | 특성합성을 이용한 교차영역 모방학습 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전기및전자공학부, | - |
dc.contributor.alternativeauthor | 최성호 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.