DSpace at KOASAS: A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system

DSpace at KOASAS

College of Engineering(공과대학)Dept. of Chemical and Biomolecular Engineering(생명화학공학과)CBE-Journal Papers(저널논문)

A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system

Cited 39 time in

Cited 31 time in

Hit : 381
Download : 0

Export

Kim, Jong Woo / Park, Byung Jun / Yoo, Haeun / Oh, Tae Hoon / Lee, Jay H.researcher / Lee, Jong Min

The Hamilton-Jacobi-Bellman (HJB) equation can be solved to obtain optimal closed-loop control policies for general nonlinear systems. As it is seldom possible to solve the HJB equation exactly for nonlinear systems, either analytically or numerically, methods to build approximate solutions through simulation based learning have been studied in various names like neurodynamic programming (NDP) and approximate dynamic programming (ADP). The aspect of learning connects these methods to reinforcement learning (RL), which also tries to learn optimal decision policies through trial-and-error based learning. This study develops a model-based RL method, which iteratively learns the solution to the HJB and its associated equations. We focus particularly on the control-affine system with a quadratic objective function and the finite horizon optimal control (FHOC) problem with time-varying reference trajectories. The HJB solutions for such systems involve time-varying value, costate, and policy functions subject to boundary conditions. To represent the time-varying HJB solution in high-dimensional state space in a general and efficient way, deep neural networks (DNNs) are employed. It is shown that the use of DNNs, compared to shallow neural networks (SNNs), can significantly improve the performance of a learned policy in the presence of uncertain initial state and state noise. Examples involving a batch chemical reactor and a one-dimensional diffusion-convection-reaction system are used to demonstrate this and other key aspects of the method.

Publisher: ELSEVIER SCI LTD

Issue Date: 2020-03

Language: English

Article Type: Article

Citation: JOURNAL OF PROCESS CONTROL, v.87, pp.166 - 178

ISSN: 0959-1524

DOI: 10.1016/j.jprocont.2020.02.003

URI: http://hdl.handle.net/10203/273800

Appears in Collection: CBE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 39 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system

This item is cited by other documents in WoS

KOASAS

Communities & Collections