A Generalist FaceX via Learning Unified Facial Representation

Han, Yue; Zhang, Jiangning; Zhu, Junwei; Li, Xiangtai; Ge, Yanhao; Li, Wei; Wang, Chengjie; Liu, Yong; Liu, Xiaoming; Tai, Ying

Computer Science > Computer Vision and Pattern Recognition

arXiv:2401.00551 (cs)

[Submitted on 31 Dec 2023]

Title:A Generalist FaceX via Learning Unified Facial Representation

Authors:Yue Han, Jiangning Zhang, Junwei Zhu, Xiangtai Li, Yanhao Ge, Wei Li, Chengjie Wang, Yong Liu, Xiaoming Liu, Ying Tai

View PDF HTML (experimental)

Abstract:This work presents FaceX framework, a novel facial generalist model capable of handling diverse facial tasks simultaneously. To achieve this goal, we initially formulate a unified facial representation for a broad spectrum of facial editing tasks, which macroscopically decomposes a face into fundamental identity, intra-personal variation, and environmental factors. Based on this, we introduce Facial Omni-Representation Decomposing (FORD) for seamless manipulation of various facial components, microscopically decomposing the core aspects of most facial editing tasks. Furthermore, by leveraging the prior of a pretrained StableDiffusion (SD) to enhance generation quality and accelerate training, we design Facial Omni-Representation Steering (FORS) to first assemble unified facial representations and then effectively steer the SD-aware generation process by the efficient Facial Representation Controller (FRC). %Without any additional features, Our versatile FaceX achieves competitive performance compared to elaborate task-specific models on popular facial editing tasks. Full codes and models will be available at this https URL.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2401.00551 [cs.CV]
	(or arXiv:2401.00551v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2401.00551

Submission history

From: Yue Han [view email]
[v1] Sun, 31 Dec 2023 17:41:48 UTC (30,976 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Generalist FaceX via Learning Unified Facial Representation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Generalist FaceX via Learning Unified Facial Representation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators