EpiGRAF: Rethinking training of 3D GANs

Skorokhodov, Ivan; Tulyakov, Sergey; Wang, Yiqun; Wonka, Peter

Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.10535 (cs)

[Submitted on 21 Jun 2022 (v1), last revised 15 Dec 2022 (this version, v2)]

Title:EpiGRAF: Rethinking training of 3D GANs

Authors:Ivan Skorokhodov, Sergey Tulyakov, Yiqun Wang, Peter Wonka

View PDF

Abstract:A very recent trend in generative modeling is building 3D-aware generators from 2D image collections. To induce the 3D bias, such models typically rely on volumetric rendering, which is expensive to employ at high resolutions. During the past months, there appeared more than 10 works that address this scaling issue by training a separate 2D decoder to upsample a low-resolution image (or a feature tensor) produced from a pure 3D generator. But this solution comes at a cost: not only does it break multi-view consistency (i.e. shape and texture change when the camera moves), but it also learns the geometry in a low fidelity. In this work, we show that it is possible to obtain a high-resolution 3D generator with SotA image quality by following a completely different route of simply training the model patch-wise. We revisit and improve this optimization scheme in two ways. First, we design a location- and scale-aware discriminator to work on patches of different proportions and spatial positions. Second, we modify the patch sampling strategy based on an annealed beta distribution to stabilize training and accelerate the convergence. The resulted model, named EpiGRAF, is an efficient, high-resolution, pure 3D generator, and we test it on four datasets (two introduced in this work) at $256^2$ and $512^2$ resolutions. It obtains state-of-the-art image quality, high-fidelity geometry and trains ${\approx} 2.5 \times$ faster than the upsampler-based counterparts. Project website: this https URL.

Comments:	NeurIPS 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2206.10535 [cs.CV]
	(or arXiv:2206.10535v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.10535

Submission history

From: Ivan Skorokhodov [view email]
[v1] Tue, 21 Jun 2022 17:08:23 UTC (14,728 KB)
[v2] Thu, 15 Dec 2022 15:25:28 UTC (18,066 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:EpiGRAF: Rethinking training of 3D GANs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:EpiGRAF: Rethinking training of 3D GANs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators