Long-Context LLM Extension

11:31
What is a Context Window? Unlocking LLM Secrets

49:33
How DeepSeek Changes the LLM Story

13:39
How Rotary Position Embedding Supercharges Modern LLMs

47:56
Speculations on Test-Time Scaling (o1)

20:07
The Mamba in the Llama: Distilling and Accelerating Hybrid Models

33:50
Do we need Attention? A Mamba Primer

46:22
It's Not About Scale, It's About Abstraction

29:38