Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation

Zhang, Lu; Zhang, Siqi; Yang, Xu; Qiao, Hong; Liu, Zhiyong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2204.09847 (cs)

[Submitted on 21 Apr 2022 (v1), last revised 22 Feb 2023 (this version, v2)]

Title:Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation

Authors:Lu Zhang, Siqi Zhang, Xu Yang, Hong Qiao, Zhiyong Liu

View PDF

Abstract:Segmenting unseen objects is a crucial ability for the robot since it may encounter new environments during the operation. Recently, a popular solution is leveraging RGB-D features of large-scale synthetic data and directly applying the model to unseen real-world scenarios. However, the domain shift caused by the sim2real gap is inevitable, posing a crucial challenge to the segmentation model. In this paper, we emphasize the adaptation process across sim2real domains and model it as a learning problem on the BatchNorm parameters of a simulation-trained model. Specifically, we propose a novel non-parametric entropy objective, which formulates the learning objective for the test-time adaptation in an open-world manner. Then, a cross-modality knowledge distillation objective is further designed to encourage the test-time knowledge transfer for feature enhancement. Our approach can be efficiently implemented with only test images, without requiring annotations or revisiting the large-scale synthetic training data. Besides significant time savings, the proposed method consistently improves segmentation results on the overlap and boundary metrics, achieving state-of-the-art performance on unseen object instance segmentation.

Comments:	Accepted to ICRA 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2204.09847 [cs.CV]
	(or arXiv:2204.09847v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2204.09847

Submission history

From: Lu Zhang [view email]
[v1] Thu, 21 Apr 2022 02:35:20 UTC (1,507 KB)
[v2] Wed, 22 Feb 2023 05:33:23 UTC (1,779 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators