Scalable MatMul-free Language Modeling (Paper Explained)
40:40
Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained)
57:00
xLSTM: Extended Long Short-Term Memory
33:04
Generative Model That Won 2024 Nobel Prize
59:38
JEPA - A Path Towards Autonomous Machine Intelligence (Paper Explained)
37:17
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
1:03:56
Privacy Backdoors: Stealing Data with Corrupted Pretrained Models (Paper Explained)
22:43
How might LLMs store facts | DL7
31:51