Partially Observable Markov Decision Process with Lagged Information over Infinite Horizon

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 352
  • Download : 0
This paper shows the infinite horizon model of Partially Observable Markov Decision Process with lagged information. The lagged information is uncertain delayed observation of the process under control. Even though the optimal policy of the model exists, finding the optimal policy is very time consuming. Thus, the aim of this study is to find an .eplison.-optimal stationary policy minimizing the expected discounted total cost of the model. .EPSILON.- optimal policy is found by using a modified version of the well known policy iteration algorithm. The modification focuses to the value determination routine of the algorithm. Some properties of the approximation functions for the expected discounted cost of a stationary policy are presented. The expected discounted cost of a stationary policy is approximated based on these properties. A numerical example is also shown.
Publisher
The Korean Operations and Management Science Society
Issue Date
1991-06
Language
English
Citation

JOURNAL OF THE KOREAN OPERATIONS RESEARCH AND MANAGEMENT SCIENCE SOCIETY, v.16, no.1, pp.135 - 146

ISSN
1225-1119
URI
http://hdl.handle.net/10203/58347
Appears in Collection
MT-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0