Designing Disaggregated Evaluations of AI Systems: Choices, Considerations, and Tradeoffs

Barocas, Solon; Guo, Anhong; Kamar, Ece; Krones, Jacquelyn; Morris, Meredith Ringel; Vaughan, Jennifer Wortman; Wadsworth, Duncan; Wallach, Hanna

Computer Science > Computers and Society

arXiv:2103.06076 (cs)

[Submitted on 10 Mar 2021 (v1), last revised 1 Dec 2021 (this version, v2)]

Title:Designing Disaggregated Evaluations of AI Systems: Choices, Considerations, and Tradeoffs

Authors:Solon Barocas, Anhong Guo, Ece Kamar, Jacquelyn Krones, Meredith Ringel Morris, Jennifer Wortman Vaughan, Duncan Wadsworth, Hanna Wallach

View PDF

Abstract:Disaggregated evaluations of AI systems, in which system performance is assessed and reported separately for different groups of people, are conceptually simple. However, their design involves a variety of choices. Some of these choices influence the results that will be obtained, and thus the conclusions that can be drawn; others influence the impacts -- both beneficial and harmful -- that a disaggregated evaluation will have on people, including the people whose data is used to conduct the evaluation. We argue that a deeper understanding of these choices will enable researchers and practitioners to design careful and conclusive disaggregated evaluations. We also argue that better documentation of these choices, along with the underlying considerations and tradeoffs that have been made, will help others when interpreting an evaluation's results and conclusions.

Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2103.06076 [cs.CY]
	(or arXiv:2103.06076v2 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2103.06076

Submission history

From: Hanna Wallach [view email]
[v1] Wed, 10 Mar 2021 14:26:14 UTC (67 KB)
[v2] Wed, 1 Dec 2021 20:38:18 UTC (943 KB)

Computer Science > Computers and Society

Title:Designing Disaggregated Evaluations of AI Systems: Choices, Considerations, and Tradeoffs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Designing Disaggregated Evaluations of AI Systems: Choices, Considerations, and Tradeoffs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators