Showing results 1 to 1 of 1
Optimization for Reinforcement Learning: From a single agent to cooperative agents Lee, Donghwan; He, Niao; Kamalaruban, Parameswaran; Cevher, Volkan, IEEE SIGNAL PROCESSING MAGAZINE, v.37, no.3, pp.123 - 135, 2020-05 |
Discover