LongVideoBench
PulseAugur coverage of LongVideoBench — every cluster mentioning LongVideoBench across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
Kwai releases Keye-VL-2.0 for long-video understanding
Kwai has released Keye-VL-2.0-30B-A3B, an open-source multimodal foundation model designed for long-video understanding and agentic intelligence. This model utilizes DeepSeek Sparse Attention to process up to 256K conte…
-
CREST method efficiently selects key frames from long videos
Researchers have developed CREST, a novel method for efficiently selecting key frames from long videos. This training-free approach leverages the temporal geometry of query-frame relevance, specifically focusing on loca…
-
GridProbe cuts VLM compute cost for long videos
Researchers have developed GridProbe, a novel method to improve the efficiency of long-video Visual Language Models (VLMs). This technique adaptively selects relevant frames during inference, reducing the computational …
-
LinMU achieves linear complexity for multimodal understanding models
Researchers have developed LinMU, a novel Vision-Language Model (VLM) architecture that achieves linear complexity, overcoming the quadratic complexity limitations of current models. This new design utilizes an M-MATE b…