Showing results 6 to 7 of 7
Off-policy multi-agent policy optimization with multi-step counterfactual advantage estimation Kim, Seongmin; Kim, Woojun; Jeon, Jeewon; Sung, Youngchul; Han, Seungyul, The 22nd International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2023, AAMAS Workshop, 2023-05 |
Robust imitation learning against variations in environment dynamics Chae, Jongseong; Han, Seungyul; Jung, Whiyoung; Cho, Myung-Sik; Choi, Sungho; Sung, Youngchul, The 39th International Conference on Machine Learning, ICML 2022, International Conference on Machine Learning, 2022-07-23 |
Discover