Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Huang, Rui; Peng, Songyou; Takmaz, Ayca; Tombari, Federico; Pollefeys, Marc; Song, Shiji; Huang, Gao; Engelmann, Francis

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.17232 (cs)

[Submitted on 28 Dec 2023]

Title:Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Authors:Rui Huang, Songyou Peng, Ayca Takmaz, Federico Tombari, Marc Pollefeys, Shiji Song, Gao Huang, Francis Engelmann

View PDF HTML (experimental)

Abstract:Current 3D scene segmentation methods are heavily dependent on manually annotated 3D training datasets. Such manual annotations are labor-intensive, and often lack fine-grained details. Importantly, models trained on this data typically struggle to recognize object classes beyond the annotated classes, i.e., they do not generalize well to unseen domains and require additional domain-specific annotations. In contrast, 2D foundation models demonstrate strong generalization and impressive zero-shot abilities, inspiring us to incorporate these characteristics from 2D models into 3D models. Therefore, we explore the use of image segmentation foundation models to automatically generate training labels for 3D segmentation. We propose Segment3D, a method for class-agnostic 3D scene segmentation that produces high-quality 3D segmentation masks. It improves over existing 3D segmentation models (especially on fine-grained masks), and enables easily adding new training data to further boost the segmentation performance -- all without the need for manual training labels.

Comments:	Project Page: this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.17232 [cs.CV]
	(or arXiv:2312.17232v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.17232

Submission history

From: Francis Engelmann [view email]
[v1] Thu, 28 Dec 2023 18:57:11 UTC (12,194 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators