Building and Curating Datasets for RLHF and LLM Fine-tuning // Daniel Vila Suero // LLMs in Prod Con
20:57
1984 All Over Again? An Open Ecosystem to Fight Closed Models // Filippo Pedrazzini // LLMs in Prod
30:28
Enabling Cost-Efficient LLM Serving with Ray Serve
59:15
Reinforcement Learning with Human Feedback (RLHF)
19:39
RLHF & DPO Explained (In Simple Terms!)
1:05:27
Fine-tuning Language Models for Structured Responses with QLoRa
49:11
LLMOps (LLM Bootcamp)
1:01:01
Mastering RLHF with AWS: A Hands-on Workshop on Reinforcement Learning from Human Feedback
18:54