Transductive Zero-Shot Learning with Visual Structure Constraint

Wan, Ziyu; Chen, Dongdong; Li, Yan; Yan, Xingguang; Zhang, Junge; Yu, Yizhou; Liao, Jing

Computer Science > Computer Vision and Pattern Recognition

arXiv:1901.01570 (cs)

[Submitted on 6 Jan 2019 (v1), last revised 5 Jan 2020 (this version, v2)]

Title:Transductive Zero-Shot Learning with Visual Structure Constraint

Authors:Ziyu Wan, Dongdong Chen, Yan Li, Xingguang Yan, Junge Zhang, Yizhou Yu, Jing Liao

View PDF

Abstract:To recognize objects of the unseen classes, most existing Zero-Shot Learning(ZSL) methods first learn a compatible projection function between the common semantic space and the visual space based on the data of source seen classes, then directly apply it to the target unseen classes. However, in real scenarios, the data distribution between the source and target domain might not match well, thus causing the well-known \textbf{domain shift} problem. Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (i.e. alleviate the above domain shift problem). Specifically, three different strategies (symmetric Chamfer-distance, Bipartite matching distance, and Wasserstein distance) are adopted to align the projected unseen semantic centers and visual cluster centers of test instances. We also propose a new training strategy to handle the real cases where many unrelated images exist in the test dataset, which is not considered in previous methods. Experiments on many widely used datasets demonstrate that the proposed visual structure constraint can bring substantial performance gain consistently and achieve state-of-the-art results. The source code is available at \url{this https URL}.

Comments:	NeurIPS 2019, code available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1901.01570 [cs.CV]
	(or arXiv:1901.01570v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1901.01570

Submission history

From: Ziyu Wan [view email]
[v1] Sun, 6 Jan 2019 16:43:07 UTC (4,526 KB)
[v2] Sun, 5 Jan 2020 08:23:51 UTC (570 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Transductive Zero-Shot Learning with Visual Structure Constraint

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Transductive Zero-Shot Learning with Visual Structure Constraint

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators