Researchers develop DeLock to prevent VLA policies from over-specializing post-training

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have identified a phenomenon called "lock-in" in vision-language-action (VLA) policies, where extensive post-training on limited data causes the policy to become overly specialized and unresponsive to new instructions. This issue manifests as concept lock-in, where the policy fixates on training objects, and spatial lock-in, where it fixates on training spatial targets. A new method called DeLock aims to preserve the policy's internal pre-trained knowledge during post-training and uses contrastive prompt guidance at test time to steer the policy towards novel instructions, outperforming existing baselines across various evaluations. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Introduces a method to improve generalization in VLA policies post-training, potentially enabling more robust robotic control.

RANK_REASON Academic paper introducing a new method (DeLock) to address a specific problem (lock-in) in VLA policies.

Read on arXiv cs.CV →

paper
safety

COVERAGE [1]

arXiv cs.CV TIER_1 · Suning Huang, Jiaqi Shao, Ke Wang, Qianzhong Chen, Jiankai Sun, Yanjiang Guo, Mac Schwager, Jeannette Bohg · 2026-04-28 04:00

Breaking Lock-In: Preserving Steerability under Low-Data VLA Post-Training

arXiv:2604.23121v1 Announce Type: cross Abstract: Have you ever post-trained a generalist vision-language-action (VLA) policy on a small demonstration dataset, only to find that it stops responding to new instructions and is limited to behaviors observed during post-training? We …

COVERAGE [1]

Breaking Lock-In: Preserving Steerability under Low-Data VLA Post-Training

RELATED ENTITIES

RELATED TOPICS