Showing results 1 to 5 of 5
A Discrete-Time Switching System Analysis of Q-learning Lee, Donghwan; Hu, Jianghai; He, Niao, SIAM JOURNAL ON CONTROL AND OPTIMIZATION, v.61, no.3, pp.1861 - 1880, 2023-08 |
A unified switching system perspective and convergence analysis of Q-learning algorithms Lee, Donghwan; He, Niao, 34th Conference on Neural Information Processing Systems, NeurIPS 2020, Conference on Neural Information Processing Systems, 2020-12-07 |
Optimization for Reinforcement Learning: From a single agent to cooperative agents Lee, Donghwan; He, Niao; Kamalaruban, Parameswaran; Cevher, Volkan, IEEE SIGNAL PROCESSING MAGAZINE, v.37, no.3, pp.123 - 135, 2020-05 |
Periodic Q-learning Lee, Donghwan; He, Niao, 2nd Annual Conference on Learning for Dynamics and Control(L4DC), UC Berkeley, 2020-06-11 |
Target-Based Temporal-Difference Learning Lee, Donghwan; He, Niao, 36th International Conference on Machine Learning, ICML 2019, Carnegie Mellon University, 2019-06-12 |
Discover