Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution – Paper Explained
6:37
REPA Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You ...
20:18
Why Does Diffusion Work Better than Auto-Regression?
22:27
MAMBA and State Space Models explained | SSM explained
1:20:35
Discrete diffusion modeling by estimating the ratios of the data distribution
11:05
Mission: Impossible language models – Paper Explained [ACL 2024 recording]
8:55
Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained
19:48
Transformers explained | The architecture behind LLMs
17:30