VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads

Kupyn, Orest; Khvedchenia, Eugene; Rupprecht, Christian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.18245 (cs)

[Submitted on 25 Jul 2024]

Title:VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads

Authors:Orest Kupyn, Eugene Khvedchenia, Christian Rupprecht

View PDF HTML (experimental)

Abstract:Human head detection, keypoint estimation, and 3D head model fitting are important tasks with many applications. However, traditional real-world datasets often suffer from bias, privacy, and ethical concerns, and they have been recorded in laboratory environments, which makes it difficult for trained models to generalize. Here, we introduce VGGHeads -- a large scale synthetic dataset generated with diffusion models for human head detection and 3D mesh estimation. Our dataset comprises over 1 million high-resolution images, each annotated with detailed 3D head meshes, facial landmarks, and bounding boxes. Using this dataset we introduce a new model architecture capable of simultaneous heads detection and head meshes reconstruction from a single image in a single step. Through extensive experimental evaluations, we demonstrate that models trained on our synthetic data achieve strong performance on real images. Furthermore, the versatility of our dataset makes it applicable across a broad spectrum of tasks, offering a general and comprehensive representation of human heads. Additionally, we provide detailed information about the synthetic data generation pipeline, enabling it to be re-used for other tasks and domains.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2407.18245 [cs.CV]
	(or arXiv:2407.18245v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.18245

Submission history

From: Orest Kupyn [view email]
[v1] Thu, 25 Jul 2024 17:58:17 UTC (29,136 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators