DSpace at KOASAS: Inverse discounted-based LQR algorithm for learning human movement behaviors

DSpace at KOASAS

College of Engineering(공과대학)Dept. of Civil and Environmental Engineering(건설및환경공학과)CE-Journal Papers(저널논문)

Inverse discounted-based LQR algorithm for learning human movement behaviors

Cited 13 time in

Cited 0 time in

Hit : 257
Download : 0

Export

El-Hussieny, Haitham / Ryu, Jee-Hwan researcher

Recently, there has been an increasing interest towards understanding human movement behaviors. In this regard, one of the approaches is to retrieve the unknown underlying objective function that the human has to optimize while achieving a certain movement behavior. Existing research of behavioral understanding merely depends on predefined optimality criteria, where the minimum time, minimum variance or/and minimum effort are mainly adopted. These criteria are assumed to be constant, where the human is assumed to have the same preferences during the movement duration. However, in this paper, the optimality criteria underlying the kinematic characteristics of a certain human behavior are assumed to be exponentially discounted to account for the change in the human preferences that could happen while achieving this behavior. A new Inverse Discounted-based Linear Quadratic Regulator (ID-LQR) algorithm is developed in the light of Inverse Optimal Control (IOC) framework to find out the discounted cost function that could reproduce the measured human behavior perfectly. Meanwhile, an Incremental version of the ID-LQR algorithm is proposed to continuously refine the so far learned cost function in the case of sequentially presented demonstrations. The saccadic eye gaze movement is studied as an example to quantify both the proposed ID-LQR and Inverse ID-LQR approaches. Simulation results are encouraging and show that the saccadic trajectories generated by ID-LQR approach match the experimental data in many aspects, including position and velocity profiles of saccades. Moreover, when it is assessed by a subsequent set of scenarios, the incremental ID-LQR algorithm confirms its capability to generalize the so far retrieved cost function for the unseen saccadic demonstrations.

Publisher: SPRINGER

Issue Date: 2019-04

Language: English

Article Type: Article

Citation: APPLIED INTELLIGENCE, v.49, no.4, pp.1489 - 1501

ISSN: 0924-669X

DOI: 10.1007/s10489-018-1331-y

URI: http://hdl.handle.net/10203/287430

Appears in Collection: CE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 13 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Inverse discounted-based LQR algorithm for learning human movement behaviors

This item is cited by other documents in WoS

KOASAS

Communities & Collections