SPartAN: A Meta-algorithm for Reinforcement Learning using State Partitioning and Action Network

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 271
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorShin, Kyohongko
dc.contributor.authorLee, Taesikko
dc.date.accessioned2019-06-10T04:50:10Z-
dc.date.available2019-06-10T04:50:10Z-
dc.date.created2019-06-10-
dc.date.created2019-06-10-
dc.date.created2019-06-10-
dc.date.created2019-06-10-
dc.date.issued2018-12-10-
dc.identifier.citationWSC '18: Winter Simulation Conference, pp.4182 - 4183-
dc.identifier.urihttp://hdl.handle.net/10203/262497-
dc.description.abstractTargeting finite-horizon Markov Decision Process problems, we propose a novel approach with an aim to significantly enhance the scalability of reinforcement learning (RL) algorithms. Our approach, which we call a State Partitioning and Action Network, SPartAN in short, is a meta-algorithm that offers a framework an RL algorithm can be incorporated into. Key ideas in SPartAN are threefold: reducing the size of an original RL problem by partitioning the state space into smaller compartments, using a simulation model to directly obtain values of the terminal states of the upstream compartment, and constructing a quality heuristic policy in the downstream compartment by an action network to use in the simulation. Using temporal difference learning as an example RL algorithm, we show that SPartAN is able to reliably derive a high quality policy solution. Through empirical analysis, we also find that a smaller downstream state subspace in SPartAN yields higher performance.-
dc.languageEnglish-
dc.publisherIEEE Press-
dc.titleSPartAN: A Meta-algorithm for Reinforcement Learning using State Partitioning and Action Network-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.beginningpage4182-
dc.citation.endingpage4183-
dc.citation.publicationnameWSC '18: Winter Simulation Conference-
dc.identifier.conferencecountrySW-
dc.identifier.conferencelocationGothia Towers-
dc.contributor.localauthorLee, Taesik-
Appears in Collection
IE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0