Scalable MatMul-free Language Modeling (Paper Explained)
36:15
2024 12 24 15 43 12
57:45
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
17:07
LoRA explained (and a bit about precision and quantization)
40:40
Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained)
1:11:58
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools (Paper Explained)
57:00
xLSTM: Extended Long Short-Term Memory
59:38
JEPA - A Path Towards Autonomous Machine Intelligence (Paper Explained)
22:27