MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

Niu, Muyao; Cun, Xiaodong; Wang, Xintao; Zhang, Yong; Shan, Ying; Zheng, Yinqiang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2405.20222 (cs)

[Submitted on 30 May 2024 (v1), last revised 11 Jul 2024 (this version, v3)]

Title:MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

Authors:Muyao Niu, Xiaodong Cun, Xintao Wang, Yong Zhang, Ying Shan, Yinqiang Zheng

View PDF HTML (experimental)

Abstract:We present MOFA-Video, an advanced controllable image animation method that generates video from the given image using various additional controllable signals (such as human landmarks reference, manual trajectories, and another even provided video) or their combinations. This is different from previous methods which only can work on a specific motion domain or show weak control abilities with diffusion prior. To achieve our goal, we design several domain-aware motion field adapters (\ie, MOFA-Adapters) to control the generated motions in the video generation pipeline. For MOFA-Adapters, we consider the temporal motion consistency of the video and generate the dense motion flow from the given sparse control conditions first, and then, the multi-scale features of the given image are wrapped as a guided feature for stable video diffusion generation. We naively train two motion adapters for the manual trajectories and the human landmarks individually since they both contain sparse information about the control. After training, the MOFA-Adapters in different domains can also work together for more controllable video generation. Project Page: this https URL

Comments:	ECCV 2024 ; Project Page: this https URL ; Codes: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.20222 [cs.CV]
	(or arXiv:2405.20222v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2405.20222

Submission history

From: Muyao Niu [view email]
[v1] Thu, 30 May 2024 16:22:22 UTC (23,674 KB)
[v2] Sun, 2 Jun 2024 10:14:56 UTC (23,674 KB)
[v3] Thu, 11 Jul 2024 16:26:03 UTC (24,196 KB)

🚨2024-09-29: arxiv.org is experience DB issues. Some features are not available right now.🚨

Computer Science > Computer Vision and Pattern Recognition

Title:MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

🚨2024-09-29: arxiv.org is experience DB issues. Some features are not available right now.🚨

Computer Science > Computer Vision and Pattern Recognition

Title:MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators