Knowledge Distillation: A Good Teacher is Patient and Consistent
14:30
The Scrum Guide (In under 15 minutes!)
28:54
Structured Outputs with DSPy
19:46
Quantization vs Pruning vs Distillation: Optimizing NNs for Inference
13:29
Knowledge Distillation with TAs
9:28
How ChatGPT Cheaps Out Over Time
26:10
Attention in transformers, visually explained | DL6
40:35
Self-training with Noisy Student improves ImageNet classification (Paper Explained)
20:07