The Mamba in the Llama: Distilling and Accelerating Hybrid Models
25:45
Long-Context LLM Extension
40:25
How China’s New AI Model DeepSeek Is Threatening U.S. Dominance
25:13
Street Fighting Transformers
40:40
Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained)
33:50
Do we need Attention? A Mamba Primer
57:25
Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - 693
43:55
How can AI unlock solutions to our biggest challenges?
36:45