Researchers have developed ViewSAM, a novel framework for weakly supervised Cross-view Referring Multi-Object Tracking (CRMOT). This approach leverages foundation models like SAM2 and SAM3 to generate pseudo-supervision, reducing the need for costly frame-level annotations. ViewSAM explicitly models view-aware cross-modal semantics, enabling robust tracking across different camera perspectives with minimal additional parameters. AI
影响 Introduces a more efficient method for multi-object tracking across camera views by reducing reliance on extensive annotations.
排序理由 The cluster contains a research paper detailing a new model and framework for a specific computer vision task.
AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →