arxiv:2501.19083

MotionPCM: Real-Time Motion Synthesis with Phased Consistency Model

Published on Jan 31

Authors:

Abstract

Diffusion models have become a popular choice for human motion synthesis due to their powerful generative capabilities. However, their high computational complexity and large sampling steps pose challenges for real-time applications. Fortunately, the Consistency Model (CM) provides a solution to greatly reduce the number of sampling steps from hundreds to a few, typically fewer than four, significantly accelerating the synthesis of diffusion models. However, applying CM to text-conditioned human motion synthesis in latent space yields unsatisfactory generation results. In this paper, we introduce MotionPCM, a phased consistency model-based approach designed to improve the quality and efficiency for real-time motion synthesis in latent space. Experimental results on the HumanML3D dataset show that our model achieves real-time inference at over 30 frames per second in a single sampling step while outperforming the previous state-of-the-art with a 38.9\% improvement in FID. The code will be available for reproduction.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2501.19083 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2501.19083 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2501.19083 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.