Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88
57:05
Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89
1:22
Flash Attention in less than 5 lines of code 🧑💻
37:44
Comment les cybercriminels exploitent nos failles, décryptage avec Orange Cyberdéfense et Proofpoint
59:17
Serving 100s of LLMs on 1 GPU with LoRAX - Travis Addair | Stanford MLSys #84
33:29
AI Hardware w/ Jim Keller
1:12:05
Stanford ECON295/CS323 I 2024 I The AI Awakening, Erik Brynjolfsson
44:06
LLM inference optimization: Architecture, KV cache and Flash attention
1:06:34