DSpace at KOASAS: The Neural Information Processing Systems Foundation

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

The Neural Information Processing Systems Foundation

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 400
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Lee, Su Young	ko
dc.contributor.author	Choi, Sungik	ko
dc.contributor.author	Chung, Sae-Young	ko
dc.date.accessioned	2019-12-13T07:35:01Z	-
dc.date.available	2019-12-13T07:35:01Z	-
dc.date.created	2019-11-24	-
dc.date.issued	2019-12-10	-
dc.identifier.citation	NeurIPS 2019	-
dc.identifier.uri	http://hdl.handle.net/10203/268944	-
dc.description.abstract	We propose Episodic Backward Update (EBU) – a novel deep reinforcement learning algorithm with a direct value propagation. In contrast to the conventional use of the experience replay with uniform random sampling, our agent samples a whole episode and successively propagates the value of a state to its previous states. Our computationally efficient recursive algorithm allows sparse and delayed rewards to propagate directly through all transitions of the sampled episode. We theoretically prove the convergence of the EBU method and experimentally demonstrate its performance in both deterministic and stochastic environments. Especially in 49 games of Atari 2600 domain, EBU achieves the same mean and median human normalized performance of DQN by using only 5% and 10% of samples, respectively.	-
dc.language	English	-
dc.publisher	The Neural Information Processing Systems Foundation	-
dc.title	The Neural Information Processing Systems Foundation	-
dc.type	Conference	-
dc.type.rims	CONF	-
dc.citation.publicationname	NeurIPS 2019	-
dc.identifier.conferencecountry	CN	-
dc.identifier.conferencelocation	Vancouver Convention Centre	-
dc.contributor.localauthor	Chung, Sae-Young	-
dc.contributor.nonIdAuthor	Lee, Su Young	-
dc.contributor.nonIdAuthor	Choi, Sungik	-

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

The Neural Information Processing Systems Foundation

KOASAS

Communities & Collections