Showing results 1 to 2 of 2
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models Fan, Ying; Olivia Watkins; Yuqing Du; Hao Liu; Moonkyung Ryu; Craig Boutilier; Pieter Abbeel; et al, 37th Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, 2023-12 |
URLB: Unsupervised Reinforcement Learning Benchmark Laskin, Michael; Denis Yarats; Hao Liu; Lee, Kimin; Albert Zhan; Kevin Lu; Catherine Cang; et al, 35th Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, 2021-12 |
Discover