Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in RL

October 3, 2019
Sheila Mcllraith | University of Toronto

Reinforcement Learning Day 2019:
Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in Reinforcement Learning

研究分野
- Artificial intelligence
研究室
- Microsoft Research Lab - New York City
グループ
- Reinforcement Learning
イベント
- Reinforcement Learning Day 2019

次を見る

Introducing Muse: Our first generative AI model designed for gameplay ideation
May 20, 2026
Guiding the AI disruption to the Good Place
May 14, 2026
Yash Lara,

David Rothschild
New fine-tuning of language models: Match meaning, not tokens
May 14, 2026
Yash Lara,

Carles Domingo-Enrich
Introducing Interwhen: Steering reasoning agents with real-time verification
May 14, 2026
Yash Lara,

Amit Sharma
Introducing GitHub Agentic Workflows: AI that runs your repo
May 14, 2026
Yash Lara,

Peli de Halleux
MagenticLite: A full-stack agentic experience powered by Small Models
May 14, 2026
Harkirat Behl,

Weili Shi,

Hussein Mozannar
Generative AI for High-Stakes Decision-Making with Applications in One Health
May 12, 2026
Lingkai Kong
Physics and information theory of generative diffusion
May 5, 2026
Luca Ambrogioni
Language & Voice AI for Africa: From Data to Deployment and Impact
April 30, 2026
Vukosi Marivate,

Tavonga Siyavora,

Tobi Olatunji

、など。アル。
Upper Bound 2024: Towards Human-Centered AI in AAA Video Game
June 11, 2024
Raluca Georgescu

Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in RL

研究分野

研究室

グループ

イベント

次を見る