A Meta Algorithm For Reinforcement Learning: Emergency Medical Service Resource Prioritization Problem in an MCI as an example

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 610
  • Download : 0
We present a finite-horizon Markov Decision Process (MDP) model for a patient prioritization and hospital selection problem, which is a critical decision-making problem in emergency medical service operation. Solving this model requires reinforcement learning (RL) due to its large state space. We propose a novel approach with an aim to significantly enhance the scalability of RL algorithms. Our approach, which we call a State Partitioning and Action Network, SPartAN in short, is a meta-algorithm that offers a framework an RL algorithm can be incorporated into. In this approach, we partition the state space into smaller subspaces to construct a reliable action network in the downstream subspace. This action network is then used in a simulation to approximate values of the upstream subspace. Using temporal difference (TD) learning as an example RL algorithm, we show that SPartAN is able to reliably derive a high-quality policy solution, thereby opening opportunities to solve many practical MDP models in healthcare system problems.
Publisher
International Conference on Health Care Systems Engineering
Issue Date
2019-06-01
Language
English
Citation

4th International Conference on Health Care Systems Engineering, HCSE 2019, pp.103 - 115

ISSN
2194-1009
DOI
10.1007/978-3-030-39694-7_9
URI
http://hdl.handle.net/10203/262499
Appears in Collection
IE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0