Showing results 2 to 3 of 3
Improving upper confidence reinforcement learning with bootstrapping = 강화학습에서의 효율적 탐색을 위한 부트스트랩 기법의 활용link Kim, Sanghwa; Min, Seungki; 민승기; Kim, Kyoung-Kuk; et al, 한국과학기술원, 2022 |
Thompson sampling with information relaxation penalties Min, Seungki; Maglaras, Costis; Moallemi, Ciamac C., 33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019, Neural information processing systems foundation, 2019-12-08 |
Discover