PulseAugur
实时 09:21:39

DEFLECT framework boosts robotic VLA policy delay robustness

Researchers have developed DEFLECT, a new post-training framework designed to improve the robustness of asynchronous Vision-Language-Action (VLA) policies in robotics. This method addresses the challenge of stale observations during inference by converting latency-induced mismatches into counterfactual preference supervision. DEFLECT trains policies to favor actions aligned with the execution-time state, without requiring human labels, online robot rollouts, or additional inference computation. Experiments across various tasks showed DEFLECT significantly enhances delay robustness, improving success rates by up to 6.4 percentage points. AI

影响 Enhances robotic control by improving VLA policy performance under latency, potentially enabling more complex real-world applications.

排序理由 This is a research paper detailing a new framework for improving AI model performance in a specific domain. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

报道来源 [1]

  1. arXiv cs.AI TIER_1 English(EN) · Yixiang Zhu, Yonghao Chen, Zijie Yang, Yusong Hu, Xinyu Chen ·

    DEFLECT: Temporal Counterfactual Preference Learning for Delay-Robust Asynchronous VLAs

    arXiv:2605.19294v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) policies increasingly rely on asynchronous inference to hide large-model latency behind ongoing robot motion. While this avoids the stop-and-go behavior of synchronous action-chunk execution, i…