DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization

Tang, Yunlong; Wan, Yuxuan; Qi, Lei; Geng, Xin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.16697 (cs)

[Submitted on 25 Mar 2024 (v1), last revised 14 Jul 2024 (this version, v2)]

Title:DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization

Authors:Yunlong Tang, Yuxuan Wan, Lei Qi, Xin Geng

View PDF HTML (experimental)

Abstract:Source-Free Domain Generalization (SFDG) aims to develop a model that works for unseen target domains without relying on any source domain. Research in SFDG primarily bulids upon the existing knowledge of large-scale vision-language models and utilizes the pre-trained model's joint vision-language space to simulate style transfer across domains, thus eliminating the dependency on source domain images. However, how to efficiently simulate rich and diverse styles using text prompts, and how to extract domain-invariant information useful for classification from features that contain both semantic and style information after the encoder, are directions that merit improvement. In this paper, we introduce Dynamic PromptStyler (DPStyler), comprising Style Generation and Style Removal modules to address these issues. The Style Generation module refreshes all styles at every training epoch, while the Style Removal module eliminates variations in the encoder's output features caused by input styles. Moreover, since the Style Generation module, responsible for generating style word vectors using random sampling or style mixing, makes the model sensitive to input text prompts, we introduce a model ensemble method to mitigate this sensitivity. Extensive experiments demonstrate that our framework outperforms state-of-the-art methods on benchmark datasets.

Comments:	Accepted by IEEE TMM
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.16697 [cs.CV]
	(or arXiv:2403.16697v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.16697

Submission history

From: Yunlong Tang [view email]
[v1] Mon, 25 Mar 2024 12:31:01 UTC (2,931 KB)
[v2] Sun, 14 Jul 2024 13:27:42 UTC (4,844 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DPStyler: Dynamic PromptStyler for Source-Free Domain Generalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators