Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in RL
- Sheila Mcllraith | University of Toronto
Reinforcement Learning Day 2019:
Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in Reinforcement Learning
次を見る
-
-
-
-
Pushing boundaries of complex reasoning in small language models
- Maya Murad,
- Mojan Javaheripi
-
-
-
-
-
ML for High-Performance Climate and Earth Virtualization Engines
- Torsten Hoefler
-