DSpace at KOASAS: Constrained Bayesian Reinforcement Learning via Approximate Linear Programming

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Conference Papers(학술회의논문)

Constrained Bayesian Reinforcement Learning via Approximate Linear Programming

Cited 3 time in

Cited 0 time in

Hit : 366
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Lee, Jongmin	ko
dc.contributor.author	Jang, Youngsoo	ko
dc.contributor.author	Poupart, Pascal	ko
dc.contributor.author	Kim, Kee-Eung	ko
dc.date.accessioned	2017-08-16T08:49:53Z	-
dc.date.available	2017-08-16T08:49:53Z	-
dc.date.created	2017-06-21	-
dc.date.created	2017-06-21	-
dc.date.created	2017-06-21	-
dc.date.issued	2017-08-24	-
dc.identifier.citation	26th International Joint Conference on Artificial Intelligence, pp.2088 - 2095	-
dc.identifier.uri	http://hdl.handle.net/10203/225309	-
dc.description.abstract	In this paper, we consider the safe learning scenario where we need to restrict the exploratory behavior of a reinforcement learning agent. Specifically, we treat the problem as a form of Bayesian reinforcement learning in an environment that is modeled as a constrained MDP (CMDP) where the cost function penalizes undesirable situations. We propose a model-based Bayesian reinforcement learning (BRL) algorithm for such an environment, eliciting risk-sensitive exploration in a principled way. Our algorithm efficiently solves the constrained BRL problem by approximate linear programming, and generates a finite state controller in an offline manner. We provide theoretical guarantees and demonstrate empirically that our approach outperforms the state of the art.	-
dc.language	English	-
dc.publisher	International Joint Conferences on Artificial Intelligence Organization (IJCAI)	-
dc.title	Constrained Bayesian Reinforcement Learning via Approximate Linear Programming	-
dc.type	Conference	-
dc.identifier.wosid	000764137502029	-
dc.identifier.scopusid	2-s2.0-85031918650	-
dc.type.rims	CONF	-
dc.citation.beginningpage	2088	-
dc.citation.endingpage	2095	-
dc.citation.publicationname	26th International Joint Conference on Artificial Intelligence	-
dc.identifier.conferencecountry	AT	-
dc.identifier.conferencelocation	Melbourne Convention and Exhibition Center	-
dc.contributor.localauthor	Kim, Kee-Eung	-
dc.contributor.nonIdAuthor	Jang, Youngsoo	-
dc.contributor.nonIdAuthor	Poupart, Pascal	-

Appears in Collection: AI-Conference Papers(학술대회논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 3 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Constrained Bayesian Reinforcement Learning via Approximate Linear Programming

This item is cited by other documents in WoS

KOASAS

Communities & Collections