Time-Interval Machine, ID-Aware Movie Descriptions, and Story Summarization | Multimodal Weekly 56
1:03:44
Long-Form Video Reasoning and Question-Answering | Multimodal Weekly 55
1:15:17
Temporal Action Localization, Hallucination Benchmark, and Attention for ViTs | Multimodal Weekly 62
1:03:52
Composed Video Retrieval, Consent In Crisis, and Video Annotations at Scale | Multimodal Weekly 57
54:46
Generalized Contrastive Learning and Transforming Video Production | Multimodal Weekly 50
59:44
A Deep Dive into Twelve Labs Embed API for Multimodal Embeddings | Multimodal Weekly 66
36:17
Analysis and Insights from Holistic Evaluation on Video Foundation Models | Multimodal Weekly 65
59:55
Unified Video Segmentation and Video Object Segmentation | Multimodal Weekly 59
57:37