Lecture 13: Attention
1:12:04
Lecture 14: Visualizing and Understanding
27:07
Attention Is All You Need
1:13:27
Lecture 12: Recurrent Networks
36:16
The math behind Attention: Keys, Queries, and Values matrices
1:22:38
CS480/680 Lecture 19: Attention and Transformer Networks
17:38
The moment we stopped understanding AI [AlexNet]
1:56:20
Let's build GPT: from scratch, in code, spelled out.
1:11:16