A General Visual Representation Guided Framework with Global Affinity for Weakly Supervised Salient Object Detection

Xu, Binwei; Liang, Haoran; Gong, Weihua; Liang, Ronghua; Chen, Peng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2302.10697v1 (cs)

[Submitted on 21 Feb 2023 (this version), latest version 9 Jun 2023 (v2)]

Title:A General Visual Representation Guided Framework with Global Affinity for Weakly Supervised Salient Object Detection

Authors:Binwei Xu, Haoran Liang, Weihua Gong, Ronghua Liang, Peng Chen

View PDF

Abstract:Fully supervised salient object detection (SOD) methods have made considerable progress in performance, yet these models rely heavily on expensive pixel-wise labels. Recently, to achieve a trade-off between labeling burden and performance, scribble-based SOD methods have attracted increasing attention. Previous models directly implement the SOD task only based on small-scale SOD training data. Due to the limited information provided by the weakly scribble tags and such small-scale training data, it is extremely difficult for them to understand the image and further achieve a superior SOD task. In this paper, we propose a simple yet effective framework guided by general visual representations that simulate the general cognition of humans for scribble-based SOD. It consists of a task-related encoder, a general visual module, and an information integration module to combine efficiently the general visual representations learned from large-scale unlabeled datasets with task-related features to perform the SOD task based on understanding the contextual connections of images. Meanwhile, we propose a novel global semantic affinity loss to guide the model to perceive the global structure of the salient objects. Experimental results on five public benchmark datasets demonstrate that our method that only utilizes scribble annotations without introducing any extra label outperforms the state-of-the-art weakly supervised SOD methods and is comparable or even superior to the state-of-the-art fully supervised models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2302.10697 [cs.CV]
	(or arXiv:2302.10697v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2302.10697

Submission history

From: Binwei Xu [view email]
[v1] Tue, 21 Feb 2023 14:31:57 UTC (2,906 KB)
[v2] Fri, 9 Jun 2023 01:30:00 UTC (5,471 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A General Visual Representation Guided Framework with Global Affinity for Weakly Supervised Salient Object Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A General Visual Representation Guided Framework with Global Affinity for Weakly Supervised Salient Object Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators