ENTITY
multimodal large language model
multimodal large language model
PulseAugur coverage of multimodal large language model — every cluster mentioning multimodal large language model across labs, papers, and developer communities, ranked by signal.
Total · 30d
25
25 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
25
25 over 90d
TIER MIX · 90D
SENTIMENT · 30D
5 day(s) with sentiment data
RECENT · PAGE 2/2 · 22 TOTAL
-
New frameworks enhance VLM spatial reasoning with world models and multi-agent systems
Researchers have developed World2VLM, a novel training framework that distills spatial reasoning capabilities from generative world models into vision-language models (VLMs). This approach synthesizes future views to pr…
-
ByteDance unveils Astra, a dual-model AI for advanced robot navigation
ByteDance has introduced Astra, a novel dual-model architecture designed to enhance autonomous robot navigation in complex indoor environments. The system employs a System 1/System 2 approach, with Astra-Global handling…