Unimotion: Unifying 3D Human Motion Synthesis and Understanding

Li, Chuqiao; Chibane, Julian; He, Yannan; Pearl, Naama; Geiger, Andreas; Pons-moll, Gerard

Computer Science > Computer Vision and Pattern Recognition

arXiv:2409.15904 (cs)

[Submitted on 24 Sep 2024 (v1), last revised 30 Sep 2024 (this version, v2)]

Title:Unimotion: Unifying 3D Human Motion Synthesis and Understanding

Authors:Chuqiao Li, Julian Chibane, Yannan He, Naama Pearl, Andreas Geiger, Gerard Pons-moll

View PDF HTML (experimental)

Abstract:We introduce Unimotion, the first unified multi-task human motion model capable of both flexible motion control and frame-level motion understanding. While existing works control avatar motion with global text conditioning, or with fine-grained per frame scripts, none can do both at once. In addition, none of the existing works can output frame-level text paired with the generated poses. In contrast, Unimotion allows to control motion with global text, or local frame-level text, or both at once, providing more flexible control for users. Importantly, Unimotion is the first model which by design outputs local text paired with the generated poses, allowing users to know what motion happens and when, which is necessary for a wide range of applications. We show Unimotion opens up new applications: 1.) Hierarchical control, allowing users to specify motion at different levels of detail, 2.) Obtaining motion text descriptions for existing MoCap data or YouTube videos 3.) Allowing for editability, generating motion from text, and editing the motion via text edits. Moreover, Unimotion attains state-of-the-art results for the frame-level text-to-motion task on the established HumanML3D dataset. The pre-trained model and code are available available on our project page at this https URL.

Comments:	Project Page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2409.15904 [cs.CV]
	(or arXiv:2409.15904v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2409.15904

Submission history

From: Chuqiao Li [view email]
[v1] Tue, 24 Sep 2024 09:20:06 UTC (27,254 KB)
[v2] Mon, 30 Sep 2024 10:39:38 UTC (54,503 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unimotion: Unifying 3D Human Motion Synthesis and Understanding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unimotion: Unifying 3D Human Motion Synthesis and Understanding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators