Video-MME
PulseAugur coverage of Video-MME — every cluster mentioning Video-MME across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
ReTool-Video enhances video agents with recursive tool use
Researchers have introduced ReTool-Video, a novel approach for video understanding agents that enhances their reasoning capabilities. This method utilizes an expanded tool library with 134 specialized tools, including m…
-
LinMU achieves linear complexity for multimodal understanding models
Researchers have developed LinMU, a novel Vision-Language Model (VLM) architecture that achieves linear complexity, overcoming the quadratic complexity limitations of current models. This new design utilizes an M-MATE b…
-
Introducing gpt-realtime and Realtime API updates
OpenAI has released GPT-4.1, a new series of models for its API that offer significant improvements in coding, instruction following, and long context comprehension, outperforming previous models like GPT-4o. The compan…