The Bitterest of Lessons: The Role of Data and Optimization in Emergence
51:03
Reinforcement Learning Pretraining for Reinforcement Learning Finetuning
38:23
Making Real-World Reinforcement Learning Practical
48:32
Sparks of AGI: early experiments with GPT-4
1:16:10
L1 MDPs, Exact Solution Methods, Max-ent RL (Foundations of Deep RL Series)
10:27
OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation
42:20
Ensuring Safety in Online Reinforcement Learning by Leveraging Offline Data
1:33:35
Sergey Levine on the bottlenecks to generalization in RL and picking good research problems
29:19