PulseAugur
EN
LIVE 07:03:38
ENTITY VSI-Bench

VSI-Bench

PulseAugur coverage of VSI-Bench — every cluster mentioning VSI-Bench across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
8
8 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
8
8 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 8 TOTAL
  1. RESEARCH · CL_105024 ·

    New framework DR-MV3D enhances 3D visual question answering with dense rewards

    Researchers have introduced DR-MV3D, a novel framework designed to enhance multi-view 3D visual question answering (MV3D-VQA). This approach utilizes dense, verifiable rewards to supervise the reasoning process, moving …

  2. RESEARCH · CL_97820 ·

    OneCanvas simplifies 3D scene understanding for VLMs with panoramic reprojection

    Researchers have developed OneCanvas, a novel approach to 3D scene understanding for vision-language models (VLMs). Instead of complex geometry encoders or extensive training, OneCanvas projects patch features onto a si…

  3. TOOL · CL_79746 ·

    New framework AlloSpatial boosts foundation model spatial reasoning

    Researchers have introduced AlloSpatial, a new framework designed to enhance the spatial reasoning capabilities of foundation models. This framework converts egocentric observations into structured allocentric represent…

  4. TOOL · CL_92090 ·

    New AlloSpatial Framework Boosts AI Spatial Reasoning

    Researchers have developed AlloSpatial, a new framework designed to improve the spatial reasoning capabilities of foundation models. This framework addresses the limitation of current models by converting egocentric obs…

  5. RESEARCH · CL_44057 ·

    Cambrian-P video model uses camera pose for improved spatial reasoning

    Researchers have introduced Cambrian-P, a novel video multimodal large language model (MLLM) that incorporates camera pose information. This approach treats video frames not as isolated images but as part of a continuou…

  6. RESEARCH · CL_14362 ·

    GeoThinker framework actively integrates geometry for advanced spatial reasoning

    Researchers have developed GeoThinker, a novel framework that enhances spatial reasoning in multimodal large language models (MLLMs) by actively integrating geometric information. Unlike previous passive fusion methods,…

  7. RESEARCH · CL_06186 ·

    VLMs tackle visual illusions, spatial reasoning, and evaluation benchmarks

    Researchers are developing new methods to improve the robustness and reasoning capabilities of Vision-Language Models (VLMs). One approach, Structured Qualitative Inference (SQI), aims to mitigate visual illusions by en…

  8. RESEARCH · CL_02944 ·

    New frameworks enhance VLM spatial reasoning with world models and multi-agent systems

    Researchers have developed World2VLM, a novel training framework that distills spatial reasoning capabilities from generative world models into vision-language models (VLMs). This approach synthesizes future views to pr…