Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87
1:16:48
Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88
58:58
FlashAttention - Tri Dao | Stanford MLSys #67
57:19
Efficiently Modeling Long Sequences with Structured State Spaces - Albert Gu | Stanford MLSys #46
59:17
Serving 100s of LLMs on 1 GPU with LoRAX - Travis Addair | Stanford MLSys #84
1:19:27
Stanford CS25: V3 I Retrieval Augmented Language Models
56:32
Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86
1:44:31
Stanford CS229 I Machine Learning I Building Large Language Models (LLMs)
1:26:45