Mutual Information Guided Optimal Transport for Unsupervised Visible-Infrared Person Re-identification

Zhang, Zhizhong; Wang, Jiangming; Tan, Xin; Qu, Yanyun; Wang, Junping; Xie, Yong; Xie, Yuan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.12758 (cs)

[Submitted on 17 Jul 2024]

Title:Mutual Information Guided Optimal Transport for Unsupervised Visible-Infrared Person Re-identification

Authors:Zhizhong Zhang, Jiangming Wang, Xin Tan, Yanyun Qu, Junping Wang, Yong Xie, Yuan Xie

View PDF HTML (experimental)

Abstract:Unsupervised visible infrared person re-identification (USVI-ReID) is a challenging retrieval task that aims to retrieve cross-modality pedestrian images without using any label information. In this task, the large cross-modality variance makes it difficult to generate reliable cross-modality labels, and the lack of annotations also provides additional difficulties for learning modality-invariant features. In this paper, we first deduce an optimization objective for unsupervised VI-ReID based on the mutual information between the model's cross-modality input and output. With equivalent derivation, three learning principles, i.e., "Sharpness" (entropy minimization), "Fairness" (uniform label distribution), and "Fitness" (reliable cross-modality matching) are obtained. Under their guidance, we design a loop iterative training strategy alternating between model training and cross-modality matching. In the matching stage, a uniform prior guided optimal transport assignment ("Fitness", "Fairness") is proposed to select matched visible and infrared prototypes. In the training stage, we utilize this matching information to introduce prototype-based contrastive learning for minimizing the intra- and cross-modality entropy ("Sharpness"). Extensive experimental results on benchmarks demonstrate the effectiveness of our method, e.g., 60.6% and 90.3% of Rank-1 accuracy on SYSU-MM01 and RegDB without any annotations.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.12758 [cs.CV]
	(or arXiv:2407.12758v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.12758

Submission history

From: Jiangming Wang [view email]
[v1] Wed, 17 Jul 2024 17:32:07 UTC (25,947 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Mutual Information Guided Optimal Transport for Unsupervised Visible-Infrared Person Re-identification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Mutual Information Guided Optimal Transport for Unsupervised Visible-Infrared Person Re-identification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators