Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in RL
- Sheila Mcllraith | University of Toronto
Reinforcement Learning Day 2019:
Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in Reinforcement Learning
다음 볼만한 동영상
-
-
Designing Dynamic Measure Transport for Sampling
- Aimee Maurais
-
-
-
-
-
-
-
-
Upper Bound 2024: Towards Human-Centered AI in AAA Video Game
- Raluca Georgescu