论文索引
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
- Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang
- NeurIPS
- 2022
下载
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
- Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang
- NeurIPS
- 2022
下载
A Unified Diversity Measure for Multiagent Reinforcement Learning
- Zongkai Liu, Chao Yu, Yaodong Yang, Peng Sun, Zifan Wu, Yuan Li
- NeurIPS
- 2022
下载
Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
- Zhizhou Ren, Anji Liu, Yitao Liang, Jian Peng, Jianzhu Ma
- NeurIPS
- 2022
下载
Constrained Update Projection Approach to Safe Policy Optimization
- Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan
- NeurIPS
- 2022
下载
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
- Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuang Jiang, Stephen Marcus McAleer, Yiran Geng, Hao Dong, Zongqing Lu, Song-Chun Zhu, Yaodong Yang
- NeurIPS Datasets and Benchmarks
- 2022
下载