LambdaNetworks: Modeling long-range Interactions without Attention (Paper Explained)
![](https://i.ytimg.com/vi/TrdevFK_am4/mqdefault.jpg)
29:56
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)
![](https://i.ytimg.com/vi/j4xgkjWlfL4/mqdefault.jpg)
55:46
OpenAI DALL·E: Creating Images from Text (Blog Post Explained)
![](https://i.ytimg.com/vi/P_fHJIYENdI/mqdefault.jpg)
24:52
The Most Useful Thing AI Has Done
![](https://i.ytimg.com/vi/B45FlSQ8ITo/mqdefault.jpg)
49:45
Scalable MatMul-free Language Modeling (Paper Explained)
![](https://i.ytimg.com/vi/qlBL5bIVLvI/mqdefault.jpg)
57:11
Introduction to long range interactions: a theoretical physicist's view - Lecture 1
![](https://i.ytimg.com/vi/S27pHKBEp30/mqdefault.jpg)
28:48
LSTM is dead. Long Live Transformers!
![](https://i.ytimg.com/vi/3a0_hAiFKag/mqdefault.jpg)
37:01
TransformerFAM: Feedback attention is working memory
![](https://i.ytimg.com/vi/pH2jZun8MoY/mqdefault.jpg)
30:54