LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes

Marrie, Juliette; Menegaux, Romain; Arbel, Michael; Larlus, Diane; Mairal, Julien

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.14462 (cs)

[Submitted on 18 Oct 2024 (v1), last revised 28 Jan 2025 (this version, v4)]

Title:LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes

Authors:Juliette Marrie, Romain Menegaux, Michael Arbel, Diane Larlus, Julien Mairal

View PDF HTML (experimental)

Abstract:We address the problem of extending the capabilities of vision foundation models such as DINO, SAM, and CLIP, to 3D tasks. Specifically, we introduce a novel method to uplift 2D image features into Gaussian Splatting representations of 3D scenes. Unlike traditional approaches that rely on minimizing a reconstruction loss, our method employs a simpler and more efficient feature aggregation technique, augmented by a graph diffusion mechanism. Graph diffusion refines 3D features, such as coarse segmentation masks, by leveraging 3D geometry and pairwise similarities induced by DINOv2. Our approach achieves performance comparable to the state of the art on multiple downstream tasks while delivering significant speed-ups. Notably, we obtain competitive segmentation results using generic DINOv2 features, despite DINOv2 not being trained on millions of annotated segmentation masks like SAM. When applied to CLIP features, our method demonstrates strong performance in open-vocabulary object localization tasks, highlighting the versatility of our approach.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.14462 [cs.CV]
	(or arXiv:2410.14462v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.14462

Submission history

From: Juliette Marrie [view email]
[v1] Fri, 18 Oct 2024 13:44:29 UTC (46,115 KB)
[v2] Thu, 5 Dec 2024 10:34:11 UTC (28,112 KB)
[v3] Fri, 6 Dec 2024 15:39:13 UTC (28,111 KB)
[v4] Tue, 28 Jan 2025 18:35:41 UTC (27,852 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators