How does GRPO work?
![](https://i.ytimg.com/vi/C4HxJQ2QzWo/mqdefault.jpg)
1:18:19
Reinforcement Learning for LLMs in 2025
![](https://i.ytimg.com/vi/Y7aaU2VqTvg/mqdefault.jpg)
11:27
¿Cuál es el mejor LLM? Google vs OpenAI, Anthropic y DeepSeek
![](https://i.ytimg.com/vi/Yi1UCrAsf4o/mqdefault.jpg)
24:22
Group Relative Policy Optimization (GRPO) - Formula and Code
![](https://i.ytimg.com/vi/nYDBx78itDo/mqdefault.jpg)
49:45
Modelos y técnicas de incrustación avanzados para RAG
![](https://i.ytimg.com/vi/90ImcYM0xWc/mqdefault.jpg)
35:27
GRPO: How DeepSeek R1's Reinforcement Learning Works
![](https://i.ytimg.com/vi/7xTGNNLPyMI/mqdefault.jpg)
3:31:24
Deep Dive into LLMs like ChatGPT
![](https://i.ytimg.com/vi/iP_UmDs_i5s/mqdefault.jpg)
1:01:57
How Deepseek v3 made Compute and Export Controls Less Relevant
![](https://i.ytimg.com/vi/8bIeSJ1WnC0/mqdefault.jpg)
47:08