DSpace at KOASAS: State Entropy Maximization with Random Encoders for Efficient Exploration

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

State Entropy Maximization with Random Encoders for Efficient Exploration

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 143
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Seo, Younggyo	ko
dc.contributor.author	Chen, Lili	ko
dc.contributor.author	Shin, Jinwoo	ko
dc.contributor.author	Lee, Honglak	ko
dc.contributor.author	Abbeel, Pieter	ko
dc.contributor.author	Lee, Kimin	ko
dc.date.accessioned	2022-01-14T06:56:15Z	-
dc.date.available	2022-01-14T06:56:15Z	-
dc.date.created	2021-12-02	-
dc.date.created	2021-12-02	-
dc.date.created	2021-12-02	-
dc.date.created	2021-12-02	-
dc.date.issued	2021-07	-
dc.identifier.citation	38th International Conference on Machine Learning, ICML 2021	-
dc.identifier.issn	2640-3498	-
dc.identifier.uri	http://hdl.handle.net/10203/291830	-
dc.description.abstract	Recent exploration methods have proven to be a recipe for improving sample-efficiency in deep reinforcement learning (RL). However, efficient exploration in high-dimensional observation spaces still remains a challenge. This paper presents Random Encoders for Efficient Exploration (RE3), an exploration method that utilizes state entropy as an intrinsic reward. In order to estimate state entropy in environments with high-dimensional observations, we utilize a k-nearest neighbor entropy estimator in the low-dimensional representation space of a convolutional encoder. In particular, we find that the state entropy can be estimated in a stable and compute-efficient manner by utilizing a randomly initialized encoder, which is fixed throughout training. Our experiments show that RE3 significantly improves the sample-efficiency of both model-free and model-based RL methods on locomotion and navigation tasks from DeepMind Control Suite and MiniGrid benchmarks. We also show that RE3 allows learning diverse behaviors without extrinsic rewards, effectively improving sample-efficiency in downstream tasks.	-
dc.language	English	-
dc.publisher	JMLR-JOURNAL MACHINE LEARNING RESEARCH	-
dc.title	State Entropy Maximization with Random Encoders for Efficient Exploration	-
dc.type	Conference	-
dc.identifier.wosid	000768182705055	-
dc.type.rims	CONF	-
dc.citation.publicationname	38th International Conference on Machine Learning, ICML 2021	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	Virtual	-
dc.contributor.localauthor	Shin, Jinwoo	-
dc.contributor.localauthor	Lee, Kimin	-
dc.contributor.nonIdAuthor	Chen, Lili	-
dc.contributor.nonIdAuthor	Lee, Honglak	-
dc.contributor.nonIdAuthor	Abbeel, Pieter	-

Appears in Collection: AI-Conference Papers(학술대회논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

State Entropy Maximization with Random Encoders for Efficient Exploration

KOASAS

Communities & Collections