Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases

Guo, Wei; Caliskan, Aylin

doi:10.1145/3461702.3462536

Computer Science > Computers and Society

arXiv:2006.03955 (cs)

[Submitted on 6 Jun 2020 (v1), last revised 19 May 2021 (this version, v5)]

Title:Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases

Authors:Wei Guo, Aylin Caliskan

View PDF

Abstract:With the starting point that implicit human biases are reflected in the statistical regularities of language, it is possible to measure biases in English static word embeddings. State-of-the-art neural language models generate dynamic word embeddings dependent on the context in which the word appears. Current methods measure pre-defined social and intersectional biases that appear in particular contexts defined by sentence templates. Dispensing with templates, we introduce the Contextualized Embedding Association Test (CEAT), that can summarize the magnitude of overall bias in neural language models by incorporating a random-effects model. Experiments on social and intersectional biases show that CEAT finds evidence of all tested biases and provides comprehensive information on the variance of effect magnitudes of the same bias in different contexts. All the models trained on English corpora that we study contain biased representations.
Furthermore, we develop two methods, Intersectional Bias Detection (IBD) and Emergent Intersectional Bias Detection (EIBD), to automatically identify the intersectional biases and emergent intersectional biases from static word embeddings in addition to measuring them in contextualized word embeddings. We present the first algorithmic bias detection findings on how intersectional group members are strongly associated with unique emergent biases that do not overlap with the biases of their constituent minority identities. IBD and EIBD achieve high accuracy when detecting the intersectional and emergent biases of African American females and Mexican American females. Our results indicate that biases at the intersection of race and gender associated with members of multiple minority groups, such as African American females and Mexican American females, have the highest magnitude across all neural language models.

Comments:	19 pages, 2 figures, 4 tables
Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2006.03955 [cs.CY]
	(or arXiv:2006.03955v5 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2006.03955
Journal reference:	AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society 2021
Related DOI:	https://doi.org/10.1145/3461702.3462536

Submission history

From: Aylin Caliskan [view email]
[v1] Sat, 6 Jun 2020 19:49:50 UTC (358 KB)
[v2] Mon, 22 Jun 2020 20:08:41 UTC (69 KB)
[v3] Mon, 6 Jul 2020 18:43:34 UTC (71 KB)
[v4] Fri, 16 Apr 2021 01:45:35 UTC (2,503 KB)
[v5] Wed, 19 May 2021 15:06:28 UTC (2,504 KB)

Computer Science > Computers and Society

Title:Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Detecting Emergent Intersectional Biases: Contextualized Word Embeddings Contain a Distribution of Human-like Biases

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators