AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model, Zibin Dong*, Yifu Yuan*, Jianye HAO, Fei Ni, Yao Mu, YAN ZHENG, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu. The 12th International Conference on Learning Representations (ICLR): 2024
点击次数:
上一条:Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback, Yifu Yuan, Jianye HAO, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, YAN ZHENG. The 12th International Conference on Learning Representations (ICLR): 2024
下一条:MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL, Fei Ni, Jianye HAO, Yao Mu, Yifu Yuan, YAN ZHENG, Bin Wang, Zhixuan Liang. The 40th International Conference on Machine Learning (ICML): 2023