MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Researchers have developed MotionGPT-2, a large motion-language model designed to generate and understand human movements from text descriptions. This model integrates multimodal inputs like text and poses into a unified prompt system, enabling it to handle various motion-related tasks. MotionGPT-2 utilizes a novel motion discretization framework to ensure fine-grained control over body and hand movements, demonstrating effectiveness in generation, captioning, and completion tasks. AI
IMPACT These models advance the state-of-the-art in generating realistic human motion from text, with potential applications in animation, gaming, and virtual reality.