Outperforming cuBLAS on H100

33:29
AI Hardware w/ Jim Keller

1:16:01
Lecture 45: Outperforming cuBLAS on H100

38:44
Matrix Multiplication Assembly Instructions - Ash Vardanian | PyTorch Meetup #20

50:08
CUDA: New Features and Beyond | NVIDIA GTC 2024

55:24
Introduction to Communication Protocols - Workshop 12 S2024

26:26
10 Years Later: Software Opinions I’ve Completely Changed

1:02:40
KernelBot: Benchmark GPU Kernels on Discord

46:58