Multimodal Reasoning, Video Instruction-Tuning & Explaining Vision Backbones | Multimodal Weekly 53
1:06:52
How-to Videos, Feeling Multimodal Intelligence, & Visually-Grounded Video QA | Multimodal Weekly 52
41:22
Visual Insights from Social Data with Phyllo and Twelve Labs | Multimodal Weekly 54
1:08
Marengo 2.7: Video Search at Your Fingertips
1:03:27
Time-Interval Machine, ID-Aware Movie Descriptions, and Story Summarization | Multimodal Weekly 56
1:15:17
Temporal Action Localization, Hallucination Benchmark, and Attention for ViTs | Multimodal Weekly 62
37:05
Super.com Co-Founder's Playbook for Scaling Super App to $1B+/year in Sales - Henry Shi
58:37
Multimodal Data Lake, Video Repetition Counting, and Low-Resource Vision | Multimodal Weekly 51
58:02