R*: Efficient Reward Design via Reward Structure Evolution and Parameter Alignment Optimization with Large Language Models, Pengyi Li, Jianye HAO, Hongyao Tang, Yifu Yuan, Jinbin Qiao, Zibin Dong, YAN ZHENG. The 42nd International Conference on Machine Learning (ICML): 2025
点击次数:
上一条:MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning, Yifu Yuan, Zhenrui Zheng, Zibin Dong, Jianye HAO. The 42nd International Conference on Machine Learning (ICML): 2025
下一条:SheetAgent: Towards a Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models, Yibin Chen, Yifu Yuan, Zeyu Zhang, YAN ZHENG, Jinyi Liu, Fei Ni, Jianye HAO, Hangyu Mao, Fuzheng Zhang. The 34th International World Wide Web Conferences (WWW): 2025