GPUs in Kubernetes for AI Workloads
![](https://i.ytimg.com/vi/aM2Y9m2Kazk/mqdefault.jpg)
21:08
What The Heck Are Kubernetes Resources, CRs, CRDs, Operators, etc.?
![](https://i.ytimg.com/vi/-1H0BeN9hIk/mqdefault.jpg)
28:24
Mastering Kubernetes: Service and Network APIs (Service, Ingress, GatewayAPI)
![](https://i.ytimg.com/vi/fm1T3In3Mdc/mqdefault.jpg)
13:00
Using Clusters to Boost LLMs 🚀
![](https://i.ytimg.com/vi/Yomo2DnL9NA/mqdefault.jpg)
20:19
Ollama with GPU on Kubernetes: 70 Tokens/sec !
![](https://i.ytimg.com/vi/HQY2jgSN6pA/mqdefault.jpg)
26:42
Scaling Explained Through Kubernetes HPA, VPA, KEDA & Cluster Autoscaler
![](https://i.ytimg.com/vi/oY9le4DDAOY/mqdefault.jpg)
15:16
Unleashing WebAssembly in Kubernetes with Kwasm
![](https://i.ytimg.com/vi/rfu5FwncZ6s/mqdefault.jpg)
56:20
Building a GPU cluster for AI
![](https://i.ytimg.com/vi/U6weXlzQxoY/mqdefault.jpg)
31:25