Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback, Yifu Yuan, Jianye HAO, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, YAN ZHENG. The 12th International Conference on Learning Representations (ICLR): 2024
点击次数:
上一条:Generate Subgoal Images before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts, Fei Ni, Jianye HAO, Shiguang Wu, LongxinKou, Jiashun Liu, YAN ZHENG, Bin Wang, Yuzheng Zhuang. Conference on Computer Vision and Pattern Recognition (CVPR): 2024
下一条:AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model, Zibin Dong*, Yifu Yuan*, Jianye HAO, Fei Ni, Yao Mu, YAN ZHENG, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu. The 12th International Conference on Learning Representations (ICLR): 2024