Vision Language Models (VLMs)
PulseAugur coverage of Vision Language Models (VLMs) — every cluster mentioning Vision Language Models (VLMs) across labs, papers, and developer communities, ranked by signal.
3 天有情绪数据
-
New VLMs boost autonomous driving efficiency and spatial reasoning
Researchers are developing advanced Vision-Language Models (VLMs) for autonomous driving, focusing on improving efficiency and spatial reasoning. New methods like Fast-dDrive aim to balance high-fidelity trajectory plan…
-
New framework exposes vulnerabilities in visible-infrared vision-language models
Researchers have developed CFGPatch, a novel adversarial framework designed to expose vulnerabilities in visible-infrared vision-language models (VLMs). This method utilizes curved-edge fractal geometry and a modality-s…
-
New CRS framework boosts AI road understanding with structured supervision
Researchers have developed a new framework called the Combined Road Substrate (CRS) to improve visual reasoning for autonomous driving. CRS integrates geometric road structure with open-vocabulary semantics, allowing fo…