Deep dive: model merging (part 1)

32:15
Deep dive: model merging, part 2

2:11:12
How I use LLMs

57:45
Visualizing transformers and attention | Talk for TNG Big Tech Day '24

40:54
Deep dive - Better Attention layers for Transformer models

3:31:24
Deep Dive into LLMs like ChatGPT

1:09:00
[GRPO Explained] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

33:49
Forja y supervivencia: sin tienda de campaña ni saco de dormir: cómo fabricar un horno de carbón,...

1:19:27