Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
- Abhishek Gupta ,
- Aldo Pacchiano ,
- Yuexiang Zhai ,
- Sham Kakade ,
- Sergey Levine
Abstract to come…
微软研究院
Abstract to come…
(在新选项卡中打开)