DSpace at KOASAS: An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search

Cited 0 time in webofscience

Cited 0 time in

Hit : 195
Download : 0

Export

Lee, Kyunghyun / Lee, ByeongUk / Shin, Ukcheol / Kweon, In-So researcher

Deep reinforcement learning (DRL) algorithms and evolution strategies (ES) have been applied to various tasks, showing excellent performances. These have the opposite properties, with DRL having good sample efficiency and poor stability, while ES being vice versa. Recently, there have been attempts to combine these algorithms, but these methods fully rely on synchronous update scheme, making it not ideal to maximize the benefits of the parallelism in ES. To solve this challenge, asynchronous update scheme was introduced, which is capable of good time-efficiency and diverse policy exploration. In this paper, we introduce an Asynchronous Evolution Strategy-Reinforcement Learning (AES-RL) that maximizes the parallel efficiency of ES and integrates it with policy gradient methods. Specifically, we propose 1) a novel framework to merge ES and DRL asynchronously and 2) various asynchronous update methods that can take all advantages of asynchronism, ES, and DRL, which are exploration and time efficiency, stability, and sample efficiency, respectively. The proposed framework and update methods are evaluated in continuous control benchmark work, showing superior performance as well as time efficiency compared to the previous methods.

Publisher: Conference on Neural Information Processing Systems

Issue Date: 2020-12-07

Language: English

Citation: 34th Conference on Neural Information Processing Systems, NeurIPS 2020

URI: http://hdl.handle.net/10203/278551

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search

KOASAS

Communities & Collections