MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection

Yang, Yuxue; Fan, Lue; Zhang, Zhaoxiang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2401.16305 (cs)

[Submitted on 29 Jan 2024]

Title:MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection

Authors:Yuxue Yang, Lue Fan, Zhaoxiang Zhang

View PDF

Abstract:Label-efficient LiDAR-based 3D object detection is currently dominated by weakly/semi-supervised methods. Instead of exclusively following one of them, we propose MixSup, a more practical paradigm simultaneously utilizing massive cheap coarse labels and a limited number of accurate labels for Mixed-grained Supervision. We start by observing that point clouds are usually textureless, making it hard to learn semantics. However, point clouds are geometrically rich and scale-invariant to the distances from sensors, making it relatively easy to learn the geometry of objects, such as poses and shapes. Thus, MixSup leverages massive coarse cluster-level labels to learn semantics and a few expensive box-level labels to learn accurate poses and shapes. We redesign the label assignment in mainstream detectors, which allows them seamlessly integrated into MixSup, enabling practicality and universality. We validate its effectiveness in nuScenes, Waymo Open Dataset, and KITTI, employing various detectors. MixSup achieves up to 97.31% of fully supervised performance, using cheap cluster annotations and only 10% box annotations. Furthermore, we propose PointSAM based on the Segment Anything Model for automated coarse labeling, further reducing the annotation burden. The code is available at this https URL.

Comments:	ICLR 2024, code is available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2401.16305 [cs.CV]
	(or arXiv:2401.16305v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2401.16305

Submission history

From: Yuxue Yang [view email]
[v1] Mon, 29 Jan 2024 17:05:19 UTC (6,103 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators