ENTITY VSI-Bench

VSI-Bench

PulseAugur coverage of VSI-Bench — every cluster mentioning VSI-Bench across labs, papers, and developer communities, ranked by signal.

Total · 30d

8

8 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

8

8 over 90d

TIER MIX · 90D

TOPICS

RELATIONSHIPS

used by MindCube 70%

SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 8 TOTAL

RESEARCH · CL_105024 · Jun 22 · 00:00

New framework DR-MV3D enhances 3D visual question answering with dense rewards

Researchers have introduced DR-MV3D, a novel framework designed to enhance multi-view 3D visual question answering (MV3D-VQA). This approach utilizes dense, verifiable rewards to supervise the reasoning process, moving …
RESEARCH · CL_97820 · Jun 17 · 16:29

OneCanvas simplifies 3D scene understanding for VLMs with panoramic reprojection

Researchers have developed OneCanvas, a novel approach to 3D scene understanding for vision-language models (VLMs). Instead of complex geometry encoders or extensive training, OneCanvas projects patch features onto a si…
TOOL · CL_79746 · Jun 9 · 04:00

New framework AlloSpatial boosts foundation model spatial reasoning

Researchers have introduced AlloSpatial, a new framework designed to enhance the spatial reasoning capabilities of foundation models. This framework converts egocentric observations into structured allocentric represent…
TOOL · CL_92090 · Jun 8 · 00:00

New AlloSpatial Framework Boosts AI Spatial Reasoning

Researchers have developed AlloSpatial, a new framework designed to improve the spatial reasoning capabilities of foundation models. This framework addresses the limitation of current models by converting egocentric obs…
RESEARCH · CL_44057 · May 21 · 17:59

Cambrian-P video model uses camera pose for improved spatial reasoning

Researchers have introduced Cambrian-P, a novel video multimodal large language model (MLLM) that incorporates camera pose information. This approach treats video frames not as isolated images but as part of a continuou…
RESEARCH · CL_14362 · May 4 · 04:00

GeoThinker framework actively integrates geometry for advanced spatial reasoning

Researchers have developed GeoThinker, a novel framework that enhances spatial reasoning in multimodal large language models (MLLMs) by actively integrating geometric information. Unlike previous passive fusion methods,…
RESEARCH · CL_06186 · Apr 27 · 10:45

VLMs tackle visual illusions, spatial reasoning, and evaluation benchmarks

Researchers are developing new methods to improve the robustness and reasoning capabilities of Vision-Language Models (VLMs). One approach, Structured Qualitative Inference (SQI), aims to mitigate visual illusions by en…
RESEARCH · CL_02944 · Apr 23 · 01:19

New frameworks enhance VLM spatial reasoning with world models and multi-agent systems

Researchers have developed World2VLM, a novel training framework that distills spatial reasoning capabilities from generative world models into vision-language models (VLMs). This approach synthesizes future views to pr…