上一条:What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator, Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Changmin Yu, Hangyu Mao, Wulong Liu, Yaodong Yang, Wenyuan Tao, Li Wang. The 36th Association for the Advancement of Artificial Intelligence (AAAI): 2022
下一条:Identifying potential risk genes for clear cell renal cell carcinoma with deep reinforcement learning, Dazhi Lu, Yan Zheng, Xianyanling Yi, Jianye Hao, Xi Zeng, Lu Han, Zhigang Li, Shaoqing Jiao, Bei Jiang, Jianzhong Ai, Jiajie Peng. Nature Communications: 2025/4/15