Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

Zhu, Xiangyang; Zhang, Renrui; He, Bowei; Guo, Ziyu; Liu, Jiaming; Dong, Hao; Gao, Peng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.12961 (cs)

[Submitted on 24 Aug 2023]

Title:Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

Authors:Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Hao Dong, Peng Gao

View PDF

Abstract:To reduce the reliance on large-scale datasets, recent works in 3D segmentation resort to few-shot learning. Current 3D few-shot semantic segmentation methods first pre-train the models on `seen' classes, and then evaluate their generalization performance on `unseen' classes. However, the prior pre-training stage not only introduces excessive time overhead, but also incurs a significant domain gap on `unseen' classes. To tackle these issues, we propose an efficient Training-free Few-shot 3D Segmentation netwrok, TFS3D, and a further training-based variant, TFS3D-T. Without any learnable parameters, TFS3D extracts dense representations by trigonometric positional encodings, and achieves comparable performance to previous training-based methods. Due to the elimination of pre-training, TFS3D can alleviate the domain gap issue and save a substantial amount of time. Building upon TFS3D, TFS3D-T only requires to train a lightweight query-support transferring attention (QUEST), which enhances the interaction between the few-shot query and support data. Experiments demonstrate TFS3D-T improves previous state-of-the-art methods by +6.93% and +17.96% mIoU respectively on S3DIS and ScanNet, while reducing the training time by -90%, indicating superior effectiveness and efficiency.

Comments:	Code is available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.12961 [cs.CV]
	(or arXiv:2308.12961v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.12961

Submission history

From: Xiangyang Zhu [view email]
[v1] Thu, 24 Aug 2023 17:58:03 UTC (5,997 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators