Lecture 25: Speaking Composable Kernel (CK)
33:13
AMD HACC Tech Talk: ROCm Ecosystem and HIP Programming
1:18:35
Lecture 26: SYCL Mode (Intel GPU)
58:12
Lecture 27: gpu.cpp - Portable GPU compute using WebGPU
24:37
Efficient LLM Inference with SGLang, Lianmin Zheng, xAI
53:20
10x Coffee Talk: Strategies for PCCP Implementation
1:11:27
Lecture 28: Liger Kernel - Efficient Triton Kernels for LLM Training
34:52
CUTLASS: A CUDA C++ Template Library for Accelerating Deep Learning... Aniket Shivam & Vijay Thakkar
44:59