ENTITY Video-MLLMs

Video-MLLMs

PulseAugur coverage of Video-MLLMs — every cluster mentioning Video-MLLMs across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

5 over 90d

Releases · 30d

0 over 90d

Papers · 30d

5 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 5 TOTAL

TOOL · CL_117639 · Jun 30 · 04:00

MotionAtlas system offers detailed region captioning for videos

Researchers have introduced MotionAtlas, a novel system designed for detailed captioning of motion-centric videos. This system includes a new benchmark dataset with 2,073 multiple-choice questions, a scalable pipeline f…
RESEARCH · CL_107906 · Jun 23 · 15:50

New SER method enhances Video MLLM reasoning with semantic evidence rewards · 4 sources tracked

Researchers have developed a new method called Semantic Evidence Reward (SER) to improve the spatio-temporal reasoning capabilities of Video Multimodal Large Language Models (Video MLLMs). Existing models often struggle…
RESEARCH · CL_99809 · Jun 18 · 08:28

New CARE framework optimizes reasoning length in video-MLLMs

Researchers have introduced CARE, a novel framework designed to optimize reasoning length in multimodal video models. This competence-aware reward shaping approach adapts the model's training by shifting its preference …
TOOL · CL_97681 · Jun 16 · 19:42

New CF-GRPO framework enhances video reasoning in multimodal LLMs

Researchers have introduced Consensus Frame GRPO (CF-GRPO), a novel reward framework designed to enhance the reasoning capabilities of video multimodal large language models (Video-MLLMs). This framework operates withou…
RESEARCH · CL_08222 · Apr 28 · 03:45

FCMBench-Video benchmark evaluates document understanding in videos for AI models

Researchers have introduced FCMBench-Video, a new benchmark designed to evaluate the capabilities of Video-Multimodal Large Language Models (Video-MLLMs) in understanding documents presented in video format. This benchm…

MotionAtlas system offers detailed region captioning for videos

New SER method enhances Video MLLM reasoning with semantic evidence rewards · 4 sources tracked

New CARE framework optimizes reasoning length in video-MLLMs

New CF-GRPO framework enhances video reasoning in multimodal LLMs

FCMBench-Video benchmark evaluates document understanding in videos for AI models