MARRS: Masked Autoregressive Unit-based Reaction Synthesis
Researchers have developed MARRS, a novel framework for synthesizing human reactions conditioned on observed actions. The system utilizes a Unit-distinguished Motion Variational AutoEncoder (UD-VAE) to encode distinct body and hand units independently. It incorporates Action-Conditioned Fusion (ACF) to process reactive tokens and Mutual Unit Modulation (MUM) to enable interaction between body and hand units. A compact MLP serves as a noise predictor within a diffusion model for generating token probability distributions. AI
IMPACT Introduces a new method for generating coordinated human reaction motions, potentially improving embodied AI and animation.