ROCK star - Efficient Black-box Optimization for Policy Learning

Cited 7 time in webofscience Cited 8 time in scopus
  • Hit : 191
  • Download : 0
Robotic learning on real hardware requires an efficient algorithm which minimizes the number of trials needed to learn an optimal policy. Prolonged use of hardware causes wear and tear on the system and demands more attention from an operator. To this end, we present a novel black-box optimization algorithm, Reward Optimization with Compact Kernels and fast natural gradient regression (ROCK star). Our algorithm immediately updates knowledge after a single trial and is able to extrapolate in a controlled manner. These features make fast and safe learning on real hardware possible. We have evaluated our algorithm on two simulated reaching tasks of a 50 degree-of-freedom robot arm and on a hopping task of a real articulated legged system. ROCK star outperformed current state-of-the-art algorithms in all tasks by a factor of three or more.
Publisher
IEEE
Issue Date
2014-11
Language
English
Citation

14th IEEE-RAS International Conference on Humanoid Robots (Humanoids), pp.535 - 540

ISSN
2164-0572
DOI
10.1109/HUMANOIDS.2014.7041414
URI
http://hdl.handle.net/10203/273970
Appears in Collection
ME-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 7 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0