Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation

Jiang, Yuxin; Jiang, Liming; Yang, Shuai; Loy, Chen Change

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.12968 (cs)

[Submitted on 24 Aug 2023]

Title:Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation

Authors:Yuxin Jiang, Liming Jiang, Shuai Yang, Chen Change Loy

View PDF

Abstract:Automatic high-quality rendering of anime scenes from complex real-world images is of significant practical value. The challenges of this task lie in the complexity of the scenes, the unique features of anime style, and the lack of high-quality datasets to bridge the domain gap. Despite promising attempts, previous efforts are still incompetent in achieving satisfactory results with consistent semantic preservation, evident stylization, and fine details. In this study, we propose Scenimefy, a novel semi-supervised image-to-image translation framework that addresses these challenges. Our approach guides the learning with structure-consistent pseudo paired data, simplifying the pure unsupervised setting. The pseudo data are derived uniquely from a semantic-constrained StyleGAN leveraging rich model priors like CLIP. We further apply segmentation-guided data selection to obtain high-quality pseudo supervision. A patch-wise contrastive style loss is introduced to improve stylization and fine details. Besides, we contribute a high-resolution anime scene dataset to facilitate future research. Our extensive experiments demonstrate the superiority of our method over state-of-the-art baselines in terms of both perceptual quality and quantitative performance.

Comments:	ICCV 2023. The first two authors contributed equally. Code: this https URL Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2308.12968 [cs.CV]
	(or arXiv:2308.12968v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.12968

Submission history

From: Liming Jiang [view email]
[v1] Thu, 24 Aug 2023 17:59:50 UTC (10,844 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators