KDD Cup 2024 Multi-task Online Shopping Challenges for LLM: 1st Place Winning Solution

45:11
LLM inference optimization: Model Quantization and Distillation

39:42
Mixture of Experts: Mixtral 8x7B

50:25
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs

44:06
LLM inference optimization: Architecture, KV cache and Flash attention

33:33
Build a PodCast Recommendation with LLM

26:52
Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

27:14
Transformers (how LLMs work) explained visually | DL5

27:41