MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning, Yifu Yuan, Zhenrui Zheng, Zibin Dong, Jianye HAO. The 42nd International Conference on Machine Learning (ICML): 2025
点击次数:
上一条:DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering, Rong Cheng, Jinyi Liu, YAN ZHENG, Fei Ni, Jiazhen Du, Hangyu Mao, Fuzheng Zhang, Bo Wang, Jianye HAO. The 63rd Annual Meeting of the Association for Computational Linguistics (ACL): 2025
下一条:R*: Efficient Reward Design via Reward Structure Evolution and Parameter Alignment Optimization with Large Language Models, Pengyi Li, Jianye HAO, Hongyao Tang, Yifu Yuan, Jinbin Qiao, Zibin Dong, YAN ZHENG. The 42nd International Conference on Machine Learning (ICML): 2025