Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection

Chang, Gyusam; Lee, Jiwon; Kim, Donghyun; Kim, Jinkyu; Lee, Dongwook; Ji, Daehyun; Jang, Sujin; Kim, Sangpil

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.22461 (cs)

[Submitted on 29 Oct 2024]

Title:Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection

Authors:Gyusam Chang, Jiwon Lee, Donghyun Kim, Jinkyu Kim, Dongwook Lee, Daehyun Ji, Sujin Jang, Sangpil Kim

View PDF HTML (experimental)

Abstract:Recent advances in 3D object detection leveraging multi-view cameras have demonstrated their practical and economical value in various challenging vision tasks. However, typical supervised learning approaches face challenges in achieving satisfactory adaptation toward unseen and unlabeled target datasets (\ie, direct transfer) due to the inevitable geometric misalignment between the source and target domains. In practice, we also encounter constraints on resources for training models and collecting annotations for the successful deployment of 3D object detectors. In this paper, we propose Unified Domain Generalization and Adaptation (UDGA), a practical solution to mitigate those drawbacks. We first propose Multi-view Overlap Depth Constraint that leverages the strong association between multi-view, significantly alleviating geometric gaps due to perspective view changes. Then, we present a Label-Efficient Domain Adaptation approach to handle unfamiliar targets with significantly fewer amounts of labels (\ie, 1$\%$ and 5$\%)$, while preserving well-defined source knowledge for training efficiency. Overall, UDGA framework enables stable detection performance in both source and target domains, effectively bridging inevitable domain gaps, while demanding fewer annotations. We demonstrate the robustness of UDGA with large-scale benchmarks: nuScenes, Lyft, and Waymo, where our framework outperforms the current state-of-the-art methods.

Comments:	Accepted to NeurIPS 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.22461 [cs.CV]
	(or arXiv:2410.22461v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.22461

Submission history

From: Gyusam Chang [view email]
[v1] Tue, 29 Oct 2024 18:51:49 UTC (3,203 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators