Stefano V. Albrecht - From Deep Reinforcement Learning to LLM-based Agents
55:14
Louis Kirsch - Towards Automating ML Research with general-purpose meta-learners @ UCL DARK
52:22
Kenneth O. Stanley - Novel Opportunities in Open-Endedness @ UCL DARK
56:53
Deep Reinforcement Learning for Multi-Agent Interaction - Stefano Albrecht
1:02:12
How to Build, Evaluate, and Iterate on LLM Agents
47:52
Maciej and Bartek - Fine-tuning Reinforcement Learning Models is a Forgetting Mitigation Problem
11:29
Reinforcement Learning from Human Feedback (RLHF) Explained
1:57:12
Building a GENERAL AI agent with reinforcement learning
1:04:49