12:46
Speculative Decoding: When Two LLMs are Faster than One
22:14
How to Measure LLM Confidence: Logprobs & Structured Output
12:43
Speech LLMs: Models that listen and talk back
28:18
The Code That Revolutionized Orbital Simulation
11:17