DSpace at KOASAS: MILP based value backups in partially observed Markov decision processes (POMDPs) with very large or continuous action and observation spaces

DSpace at KOASAS

College of Engineering(공과대학)Dept. of Chemical and Biomolecular Engineering(생명화학공학과)CBE-Journal Papers(저널논문)

MILP based value backups in partially observed Markov decision processes (POMDPs) with very large or continuous action and observation spaces

Cited 3 time in

Cited 3 time in

Hit : 484
Download : 0

Export

Agrawal, Rakshita / Realff, Matthew J. / Lee, JayHyung researcher

Partially observed Markov decision processes (POMDPs) serve as powerful tools to model stochastic systems with partial state information. Since the exact solution methods for POMDPs are limited to problems with very small sizes of state, action and observation spaces, approximate point-based solution methods like PERSEUS have gained popularity. In this work, a mixed integer linear program (MILP) is developed for calculation of exact value updates (in PERSEUS and similar algorithms), when the POMDP has very large or continuous action space. Since the solution time of the MILP is very sensitive to the size of the observation space, the concept of post-decision belief space is introduced to generate a more efficient and flexible model. An example involving a flow network is presented to illustrate the concepts and compare the results with those of the existing techniques. (C) 2013 Elsevier Ltd. All rights reserved.

Publisher: PERGAMON-ELSEVIER SCIENCE LTD

Issue Date: 2013-09

Language: English

Article Type: Article

Keywords: INFINITE-HORIZON; SENSOR PLACEMENT; WATER NETWORKS

Citation: COMPUTERS & CHEMICAL ENGINEERING, v.56, pp.101 - 113

ISSN: 0098-1354

DOI: 10.1016/j.compchemeng.2013.04.020

URI: http://hdl.handle.net/10203/175493

Appears in Collection: CBE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 3 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

MILP based value backups in partially observed Markov decision processes (POMDPs) with very large or continuous action and observation spaces

This item is cited by other documents in WoS

KOASAS

Communities & Collections