Researchers have developed DEFLECT, a new post-training framework designed to improve the robustness of asynchronous Vision-Language-Action (VLA) policies in robotics. This method addresses the challenge of stale observations during inference by converting latency-induced mismatches into counterfactual preference supervision. DEFLECT trains policies to favor actions aligned with the execution-time state, without requiring human labels, online robot rollouts, or additional inference computation. Experiments across various tasks showed DEFLECT significantly enhances delay robustness, improving success rates by up to 6.4 percentage points. AI
影响 Enhances robotic control by improving VLA policy performance under latency, potentially enabling more complex real-world applications.
排序理由 This is a research paper detailing a new framework for improving AI model performance in a specific domain. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →