Debiasing Vision-Language Models via Biased Prompts

Chuang, Ching-Yao; Jampani, Varun; Li, Yuanzhen; Torralba, Antonio; Jegelka, Stefanie

Computer Science > Machine Learning

arXiv:2302.00070 (cs)

[Submitted on 31 Jan 2023 (v1), last revised 15 May 2023 (this version, v2)]

Title:Debiasing Vision-Language Models via Biased Prompts

Authors:Ching-Yao Chuang, Varun Jampani, Yuanzhen Li, Antonio Torralba, Stefanie Jegelka

View PDF

Abstract:Machine learning models have been shown to inherit biases from their training datasets. This can be particularly problematic for vision-language foundation models trained on uncurated datasets scraped from the internet. The biases can be amplified and propagated to downstream applications like zero-shot classifiers and text-to-image generative models. In this study, we propose a general approach for debiasing vision-language foundation models by projecting out biased directions in the text embedding. In particular, we show that debiasing only the text embedding with a calibrated projection matrix suffices to yield robust classifiers and fair generative models. The proposed closed-form solution enables easy integration into large-scale pipelines, and empirical results demonstrate that our approach effectively reduces social bias and spurious correlation in both discriminative and generative vision-language models without the need for additional data or training.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2302.00070 [cs.LG]
	(or arXiv:2302.00070v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.00070

Submission history

From: Ching-Yao Chuang [view email]
[v1] Tue, 31 Jan 2023 20:09:33 UTC (10,298 KB)
[v2] Mon, 15 May 2023 07:51:14 UTC (10,614 KB)

Computer Science > Machine Learning

Title:Debiasing Vision-Language Models via Biased Prompts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Debiasing Vision-Language Models via Biased Prompts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators