DSpace at KOASAS: Reinforcement Learning for Control with Multiple Frequencies

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

Reinforcement Learning for Control with Multiple Frequencies

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 359
Download : 0

Export

Lee, Jongmin / Lee, Byung-Jun / Kim, Kee-Eung researcher

Many real-world sequential decision problems involve multiple action variables whose control frequencies are different, such that actions take their effects at different periods. While these problems can be formulated with the notion of multiple action persistences in factored-action MDP (FA-MDP), it is non-trivial to solve them efficiently since an action-persistent policy constructed from a stationary policy can be arbitrarily suboptimal, rendering solution methods for the standard FA-MDPs hardly applicable. In this paper, we formalize the problem of multiple control frequencies in RL and provide its efficient solution method. Our proposed method, Action-Persistent Policy Iteration (AP-PI), provides a theoretical guarantee on the convergence to an optimal solution while incurring only a factor of |A| increase in time complexity during policy improvement step, compared to the standard policy iteration for FA-MDPs. Extending this result, we present Action-Persistent Actor-Critic (AP-AC), a scalable RL algorithm for high-dimensional control tasks. In the experiments, we demonstrate that AP-AC significantly outperforms the baselines on several continuous control tasks and a traffic control simulation, which highlights the effectiveness of our method that directly optimizes the periodic non-stationary policy for tasks with multiple control frequencies.

Publisher: Neural information processing systems foundation

Issue Date: 2020-12-10

Language: English

Citation: Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020)

URI: http://hdl.handle.net/10203/278161

Appears in Collection: RIMS Conference Papers

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Reinforcement Learning for Control with Multiple Frequencies

KOASAS

Communities & Collections