New methods KERV and HeiSD accelerate embodied VLA models with kinematic awareness

By PulseAugur Editorial · [2 sources] · 2026-04-28 04:00

Two new research papers introduce methods to accelerate the inference speed of Vision-Language-Action (VLA) models used for robot control. KERV utilizes a Kalman Filter to predict actions and adjust acceptance thresholds, achieving up to 37% acceleration with minimal success rate loss. HeiSD proposes a hybrid approach combining different speculative decoding techniques with kinematic awareness, reaching up to 2.45x speedup in simulations and 2.41x in real-world scenarios while maintaining high success rates. AI

IMPACT New techniques could significantly speed up robot control inference, enabling more responsive and efficient embodied AI systems.

RANK_REASON Two academic papers published on arXiv introduce novel methods for accelerating VLA models.

Read on arXiv cs.LG →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New methods KERV and HeiSD accelerate embodied VLA models with kinematic awareness

COVERAGE [2]

arXiv cs.LG TIER_1 English(EN) · Zihao Zheng, Zhihao Mao, Maoliang Li, Jiayu Chen, Xinhao Sun, Zhaobo Zhang, Donggang Cao, Hong Mei, Xiang Chen · 2026-04-28 04:00

KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models

arXiv:2603.01581v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) models build a token-domain robot control paradigm, yet suffer from low speed. Speculative Decoding (SD) is an optimization strategy that can boost inference speed. Two key issues emerge when i…
arXiv cs.LG TIER_1 English(EN) · Zihao Zheng, Zhihao Mao, Sicheng Tian, Maoliang Li, Jiayu Chen, Xinhao Sun, Zhaobo Zhang, Xuanzhe Liu, Donggang Cao, Hong Mei, Xiang Chen · 2026-04-28 04:00

HeiSD: Hybrid Speculative Decoding for Embodied Vision-Language-Action Models with Kinematic Awareness

arXiv:2603.17573v2 Announce Type: replace-cross Abstract: Vision-Language-Action (VLA) Models have become the mainstream solution for robot control, but suffer from slow inference speeds. Speculative Decoding (SD) is a promising acceleration method which can be divided into two c…

COVERAGE [2]

KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models

HeiSD: Hybrid Speculative Decoding for Embodied Vision-Language-Action Models with Kinematic Awareness

RELATED ENTITIES

RELATED TOPICS