NFNets: High-Performance Large-Scale Image Recognition Without Normalization (ML Paper Explained)
54:59
Dreamer v2: Mastering Atari with Discrete World Models (Machine Learning Research Paper Explained)
59:33
LambdaNetworks: Modeling long-range Interactions without Attention (Paper Explained)
16:41
NFNet and NFResNet: High-Performance Large-Scale Image Recognition Without Normalization
31:51
MAMBA from Scratch: Neural Nets Better and Faster than Transformers
1:11:58
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools (Paper Explained)
33:26
ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained)
20:18
Why Does Diffusion Work Better than Auto-Regression?
43:04