Q-learning with Flow-Matching Policies
- Qiyang (Colin) Li | UC Berkeley
Expressive policies such as diffusion and flow-matching policies have recently driven progress in robotic manipulation because they can model complex action distributions and generalize from just a handful of demonstrations. But most are still trained purely with supervised imitation learning. Optimizing them with off-policy reinforcement learning remains challenging, which limits real-world applicability for tasks that require online self-improvement and adaptations. In this talk, I will discuss approaches for making off-policy RL work with flow-matching policies.
Speaker bio
Qiyang (Colin) Li is a PhD student at UC Berkeley advised by Prof. Sergey Levine. His research interests include reinforcement learning and robot learning, with a focus on leveraging offline prior experience for online exploration. Before that, he was an undergraduate student at the University of Toronto advised by Prof. Roger Grosse.
系列: MSR New England Generative Modeling & Sampling Seminar
-
-
Physics and information theory of generative diffusion
- Luca Ambrogioni
-
-
Matching features, not tokens: Energy-based fine-tuning of language models
- Mujin Kwun,
- Carles Domingo-Enrich
-
-
-
Generative Models for Molecular Dynamics Across Timescales
- Michael Plainer,
- Winfried Ripken,
- Gregor Lied
-
-
Q-learning with Flow-Matching Policies
- Qiyang (Colin) Li
-
-
-
A non-Markovian approach to diffusion-based sampling
- Lorenz Richter
-
Blind denoising diffusion models and the blessings of dimensionality
- Aram-Alexandre Pooladian
-
Meta Flow Maps
- Peter Potaptchik