ENTITY Qwen2.5-VL-72B

Qwen2.5-VL-72B

PulseAugur coverage of Qwen2.5-VL-72B — every cluster mentioning Qwen2.5-VL-72B across labs, papers, and developer communities, ranked by signal.

Total · 30d

4

4 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

4

4 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 4 TOTAL

RESEARCH · CL_97982 · Jun 17 · 00:00

OmniAgent uses active perception for efficient video understanding · 2 sources tracked

Researchers have introduced OmniAgent, a novel omni-modal agent designed for video understanding that utilizes an iterative Observation-Thought-Action cycle based on Partially Observable Markov Decision Processes (POMDP…
RESEARCH · CL_79677 · Jun 8 · 12:09

New CapRL++ framework trains better image and video captioning models

Researchers have developed CapRL++, a novel framework for training image and video captioning models using reinforcement learning with verifiable rewards. This approach moves beyond traditional supervised fine-tuning by…
TOOL · CL_38915 · May 19 · 08:58

CodePercept boosts LLM visual perception using code, not just reasoning

Researchers from Shanghai Jiao Tong University and the Qwen team have introduced CodePercept, a novel approach to enhance large language models' visual perception capabilities, particularly for STEM tasks. Their researc…
RESEARCH · CL_20276 · May 6 · 17:32

WALDO framework improves VLM-based medical imaging anomaly detection

Researchers have developed WALDO, a novel framework for anomaly localization in medical imaging using vision-language models (VLMs). This method reformulates the problem as a comparative inference task, identifying anom…