George Hotz | Programming | Decision Transformer Reinforcement Learning (RL) | LunarLander | Part 1
8:14:18
George Hotz | Exploring | finding exploits in AMD's GPU firmware | Giving up on AMD for the tinybox
27:14
Transformers (how LLMs work) explained visually | DL5
4:56:25
George Hotz | Programming | RL is dumb and doesn't work | Reinforcement Learning LunarLander Part 2
4:17:10
George Hotz | Programming | what is the Q* algorithm? OpenAI Q Star Algorithm | Mistral 7B | PRM800K
57:27
[Part 1] Study with Me - Learning C and Build your own Lisp
19:32
Reinforcement Learning - My Algorithm vs State of the Art
5:13:54
AI'a 2 proje kodlattık! #cursor (canlı yayın)
3:57:35