Constrained Bayesian Reinforcement Learning via Approximate Linear Programming

Cited 3 time in webofscience Cited 0 time in scopus
  • Hit : 366
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLee, Jongminko
dc.contributor.authorJang, Youngsooko
dc.contributor.authorPoupart, Pascalko
dc.contributor.authorKim, Kee-Eungko
dc.date.accessioned2017-08-16T08:49:53Z-
dc.date.available2017-08-16T08:49:53Z-
dc.date.created2017-06-21-
dc.date.created2017-06-21-
dc.date.created2017-06-21-
dc.date.issued2017-08-24-
dc.identifier.citation26th International Joint Conference on Artificial Intelligence, pp.2088 - 2095-
dc.identifier.urihttp://hdl.handle.net/10203/225309-
dc.description.abstractIn this paper, we consider the safe learning scenario where we need to restrict the exploratory behavior of a reinforcement learning agent. Specifically, we treat the problem as a form of Bayesian reinforcement learning in an environment that is modeled as a constrained MDP (CMDP) where the cost function penalizes undesirable situations. We propose a model-based Bayesian reinforcement learning (BRL) algorithm for such an environment, eliciting risk-sensitive exploration in a principled way. Our algorithm efficiently solves the constrained BRL problem by approximate linear programming, and generates a finite state controller in an offline manner. We provide theoretical guarantees and demonstrate empirically that our approach outperforms the state of the art.-
dc.languageEnglish-
dc.publisherInternational Joint Conferences on Artificial Intelligence Organization (IJCAI)-
dc.titleConstrained Bayesian Reinforcement Learning via Approximate Linear Programming-
dc.typeConference-
dc.identifier.wosid000764137502029-
dc.identifier.scopusid2-s2.0-85031918650-
dc.type.rimsCONF-
dc.citation.beginningpage2088-
dc.citation.endingpage2095-
dc.citation.publicationname26th International Joint Conference on Artificial Intelligence-
dc.identifier.conferencecountryAT-
dc.identifier.conferencelocationMelbourne Convention and Exhibition Center-
dc.contributor.localauthorKim, Kee-Eung-
dc.contributor.nonIdAuthorJang, Youngsoo-
dc.contributor.nonIdAuthorPoupart, Pascal-
Appears in Collection
AI-Conference Papers(학술대회논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 3 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0