R2R-CE
PulseAugur coverage of R2R-CE — every cluster mentioning R2R-CE across labs, papers, and developer communities, ranked by signal.
1 天有情绪数据
-
StereoNav framework boosts real-world navigation for AI agents
Researchers have introduced StereoNav, a new framework designed to improve the reliability of vision-and-language navigation (VLN) agents in real-world environments. The system addresses performance degradation caused b…
-
VLN-Cache通过动态令牌缓存提高视觉语言导航模型的速度
研究人员开发了VLN-Cache,一个旨在提高视觉和语言导航(VLN)模型效率的新框架。该方法通过重用稳定的视觉令牌,解决了实时应用中冗余计算的挑战。VLN-Cache 结合了视图对齐重映射来处理相机视角的改变,以及任务相关性过滤器来管理导航过程中语义焦点的转移。在 R2R-CE 基准测试上的实验表明,在保持导航成功率的同时,速度提升高达 1.52 倍。
-
Three-Step Nav planner improves zero-shot vision-language navigation agents
Researchers have developed a new hierarchical planner called Three-Step Nav to improve zero-shot vision-and-language navigation (VLN) agents. This method uses a three-view protocol to address common issues like drifting…