Understanding Features on Evolutionary Policy Optimizations

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 111
  • Download : 0
We analyze two deep reinforcement learning algorithms, gradient-based policy optimization and evolutionary one, by a number of visualization techniques and supplement experiments. As such techniques, filter visualization and saliency map are used to examine whether meaningful features properly extracted in the two algorithms. In addition to visual analysis, some experiments are devised to enhance the validity of the analysis. We observed that an evolutionary policy optimization tends to make use of the prior knowledge and learn the prior action distribution of the policy by a powerful exploration ability, which a gradient-based algorithm cannot do easily.
Publisher
ASSOC COMPUTING MACHINERY
Issue Date
2020-04
Language
English
Citation

35th Annual ACM Symposium on Applied Computing (SAC), pp.1112 - 1118

DOI
10.1145/3341105.3373966
URI
http://hdl.handle.net/10203/288271
Appears in Collection
RIMS Conference Papers
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0