PulseAugur
EN
LIVE 10:07:36
ENTITY Vision-Language-Action (VLA) models

Vision-Language-Action (VLA) models

PulseAugur coverage of Vision-Language-Action (VLA) models — every cluster mentioning Vision-Language-Action (VLA) models across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
24
24 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
23
23 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
SENTIMENT · 30D

9 day(s) with sentiment data

RECENT · PAGE 1/2 · 24 TOTAL
  1. RESEARCH · CL_109502 ·

    Robot manipulation models gain motion priors via two-stage training · 2 sources tracked

    Researchers have developed a novel two-stage training framework to improve Vision-Language-Action (VLA) models for robot manipulation. This approach first pre-trains an action module with motion priors using uncondition…

  2. TOOL · CL_109508 ·

    New FORCE framework boosts VLA model RL fine-tuning efficiency

    Researchers have developed FORCE, a novel three-stage framework designed to improve the efficiency and stability of Reinforcement Learning (RL) fine-tuning for Vision-Language-Action (VLA) models. This approach addresse…

  3. RESEARCH · CL_99608 ·

    New Tri-Info method predicts VLA model failures with high accuracy

    Researchers have developed a new method called Tri-Info to predict failures in Vision-Language-Action (VLA) models. This approach leverages information theory to analyze the signatures of successful and failed model rol…

  4. RESEARCH · CL_106805 ·

    New methods enhance VLA model efficiency and performance in robotics · 9 sources tracked

    Researchers are developing new methods to improve the efficiency and performance of Vision-Language-Action (VLA) models in robotics. One approach, Flow Policy Optimization (FPO), uses reinforcement learning to fine-tune…

  5. TOOL · CL_97636 ·

    New framework trains VLA models on unlabeled human videos

    Researchers have developed a new framework for training Vision-Language-Action (VLA) models using unlabeled human egocentric videos. The system employs a Hybrid Disentangled VQ-VAE to separate motion dynamics from backg…

  6. RESEARCH · CL_91038 ·

    New frameworks enhance AI embodied manipulation with reasoning and physics grounding · 4 sources tracked

    Researchers have developed Guava, a framework designed to enhance embodied manipulation capabilities in AI agents by integrating high-level reasoning with external modules for perception, planning, and control. This har…

  7. RESEARCH · CL_84410 ·

    New framework adapts VLA models for dexterous robot hands

    Researchers have developed InDex, a new framework designed to adapt Vision-Language-Action (VLA) models for dexterous robotic manipulation. This method addresses the challenge of applying general VLA models, typically t…

  8. RESEARCH · CL_79104 ·

    GEAR-VLA framework enhances robotic manipulation generalization

    Researchers have developed GEAR-VLA, a new framework designed to improve the generalizability of Vision-Language-Action (VLA) models in robotic manipulation tasks. This approach addresses limitations in current VLA mode…

  9. RESEARCH · CL_76937 ·

    ActionMap improves robot policy learning with voxel heatmap

    Researchers have developed ActionMap, a novel voxel heatmap action head designed to improve robot policy learning in vision-language-action (VLA) models. This new head replaces the traditional action decoder, predicting…

  10. TOOL · CL_78002 ·

    Hugging Face paper: Robots need better data interfaces, not just bigger models

    A new position paper from Hugging Face argues that advancing robot intelligence requires more than just scaling existing Vision-Language-Action (VLA) models. The paper highlights the need for specialized interfaces to p…

  11. RESEARCH · CL_70319 ·

    VISTA framework improves robot training with validated data

    Researchers have developed VISTA, a framework designed to improve the training of Vision-Language-Action (VLA) models using real-world robot data. The framework addresses challenges such as distorted camera views and ph…

  12. TOOL · CL_68309 ·

    New S2 framework boosts VLA model generalization with evidence budgets

    Researchers have developed a new framework called S2 (See Less, Specify More) to enhance the generalization capabilities of vision-language-action (VLA) models. S2 refines the executor's training by preserving high-leve…

  13. TOOL · CL_68285 ·

    New TRAP attack hijacks VLA models via adversarial patches

    Researchers have developed a novel attack method called TRAP that exploits the Chain-of-Thought (CoT) reasoning in Vision-Language-Action (VLA) models. This attack uses adversarial patches, such as a tablecloth, to mani…

  14. RESEARCH · CL_62767 ·

    New research probes VLM susceptibility to visual persuasion and influence

    Researchers are developing new frameworks to evaluate the susceptibility of Vision-Language Models (VLMs) to multimodal persuasion and visual influences. One study introduces MMPersuade to test agent-to-agent persuasion…

  15. RESEARCH · CL_62173 ·

    New framework detects robot execution failures using trajectory data

    Researchers have developed a new framework called Hide-and-Seek to improve the reliability of robots using Vision-Language-Action (VLA) models. This method detects execution failures by identifying specific actions that…

  16. TOOL · CL_51592 ·

    New X-Foresight model enhances VLA systems with predictive world modeling

    Researchers have developed X-Foresight, a new predictive world model integrated into Vision-Language-Action (VLA) models. This model aims to equip VLA systems with physical world knowledge by predicting future video seq…

  17. TOOL · CL_51142 ·

    VLA-Pruner enhances embodied AI efficiency by optimizing visual token pruning

    Researchers have developed VLA-Pruner, a new method to make Vision-Language-Action (VLA) models more efficient for embodied AI tasks. Existing visual token pruning techniques, designed for Vision-Language Models, degrad…

  18. RESEARCH · CL_50775 ·

    New Research Rethinks VLM Initialization for Action Models

    A new paper explores how to best initialize Vision-Language-Action (VLA) models by examining the impact of pretrained Vision-Language Model (VLM) representations. The research indicates that preserving the original VLM …

  19. TOOL · CL_44730 ·

    New RAW-Dream paradigm enables zero-shot VLA model adaptation

    Researchers have introduced RAW-Dream, a new paradigm for adapting Vision-Language-Action (VLA) models without task-specific data. This approach leverages a pre-trained, task-agnostic world model for predicting future t…

  20. RESEARCH · CL_42457 ·

    Driving AI models show reasoning fragility under sensor perturbations

    A new research paper titled "Lost in Fog" investigates the reasoning fragility of Vision-Language-Action (VLA) models in autonomous driving. The study subjected the Alpamayo R1 model to various sensor perturbations, inc…