DSpace at KOASAS: Cross-domain imitation learning with feature synthesis

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

Cross-domain imitation learning with feature synthesis특성합성을 이용한 교차영역 모방학습

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 144
Download : 0

Export

Choi, Sungho

This study suggested a cross-domain imitation learning method where the agent in one domain follows the behavior of an expert in another domain without accessing any reward function. One common method is to extract domain-independent feature vectors from inputs and determine how well the agent is trained. However, due to the difference between two domains, the behavior of the agent cannot be compared appropriately with that of the expert regardless of its degree of training. To overcome this problem, we extracted two feature vectors from each input; one contained only domain information and the other contained only policy expertness information. The expert data in target-domain can be produced by combining the target-domain feature vector and the expert-policy feature vector. The proposed method showed improved performance in several environments with domain discrepancies. From this study, we can expect to alleviate the difficulties of collecting demonstrations and increase the efficiency of training agents using imitation learning.

Advisors: Sung, Youngchul researcher; 성영철 researcher

Description: 한국과학기술원 :전기및전자공학부,

Publisher: 한국과학기술원

Issue Date: 2020

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2020.2,[ii, 25 p. :]

Keywords: reinforcement learning▼aimitation learning▼adomain adaptation▼afeature synthesis▼aadversarial training; 강화학습▼a모방학습▼a영역적응▼a특성합성▼a적대적 학습

URI: http://hdl.handle.net/10203/284784

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=911414&flag=dissertation

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Cross-domain imitation learning with feature synthesis특성합성을 이용한 교차영역 모방학습

KOASAS

Communities & Collections