Matryoshka Quantization
![](https://i.ytimg.com/vi/8wGrYNDsvuo/mqdefault.jpg)
21:43
DeepCrossAttention: Supercharging Transformer Residual Connections
![](https://i.ytimg.com/vi/zLFfjAjb6j0/mqdefault.jpg)
18:15
When One LLM Drools, Multi-LLM Collaboration Rules
![](https://i.ytimg.com/vi/fzfT-7tHM8E/mqdefault.jpg)
23:58
Self-Regulation and Requesting Interventions
![](https://i.ytimg.com/vi/nEGGiTGEVMU/mqdefault.jpg)
35:13
Demystifying Long Chain-of-Thought Reasoning in LLMs
![](https://i.ytimg.com/vi/-EHxcHMPm3M/mqdefault.jpg)
31:40
Scaling up Test-Time Compute with Latent Reasoning:A Recurrent Depth Approach
![](https://i.ytimg.com/vi/ncEtgu1cmms/mqdefault.jpg)
11:02
Stop being your own worst (financial) enemy
![](https://i.ytimg.com/vi/URtF_UHYBSo/mqdefault.jpg)
1:53:12
The Elegant Math Behind Machine Learning
![](https://i.ytimg.com/vi/M9kp8cvF6aI/mqdefault.jpg)
30:00