博弈智能的研究与应用, 郝建业, 邵坤, 李凯, 李栋, 毛航宇, 胡舒悦, 王震. 中国科学·信息科学, Volume 53, Issue 10: 1892 (2023): 2023
点击次数:
上一条:Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning, Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang, Xuelong Li. Artificial Intelligence 326 (2024) 104048 (AIJ): 2023
下一条:Accelerating deep reinforcement learning via knowledge-guided policy network, Yuanqiang Yu, Peng Zhang, Kai Zhao, Yan Zheng, Jianye Hao. Journal of Autonomous Agents and Multi-Agent Systems 37(1): 17 (2023) (JAAMAS): 2023