Towards Measuring Fairness in AI: the Casual Conversations Dataset

Hazirbas, Caner; Bitton, Joanna; Dolhansky, Brian; Pan, Jacqueline; Gordo, Albert; Ferrer, Cristian Canton

Computer Science > Computer Vision and Pattern Recognition

arXiv:2104.02821 (cs)

[Submitted on 6 Apr 2021 (v1), last revised 3 Nov 2021 (this version, v2)]

Title:Towards Measuring Fairness in AI: the Casual Conversations Dataset

Authors:Caner Hazirbas, Joanna Bitton, Brian Dolhansky, Jacqueline Pan, Albert Gordo, Cristian Canton Ferrer

View PDF

Abstract:This paper introduces a novel dataset to help researchers evaluate their computer vision and audio models for accuracy across a diverse set of age, genders, apparent skin tones and ambient lighting conditions. Our dataset is composed of 3,011 subjects and contains over 45,000 videos, with an average of 15 videos per person. The videos were recorded in multiple U.S. states with a diverse set of adults in various age, gender and apparent skin tone groups. A key feature is that each subject agreed to participate for their likenesses to be used. Additionally, our age and gender annotations are provided by the subjects themselves. A group of trained annotators labeled the subjects' apparent skin tone using the Fitzpatrick skin type scale. Moreover, annotations for videos recorded in low ambient lighting are also provided. As an application to measure robustness of predictions across certain attributes, we provide a comprehensive study on the top five winners of the DeepFake Detection Challenge (DFDC). Experimental evaluation shows that the winning models are less performant on some specific groups of people, such as subjects with darker skin tones and thus may not generalize to all people. In addition, we also evaluate the state-of-the-art apparent age and gender classification methods. Our experiments provides a thorough analysis on these models in terms of fair treatment of people from various backgrounds.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2104.02821 [cs.CV]
	(or arXiv:2104.02821v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2104.02821

Submission history

From: Caner Hazirbas [view email]
[v1] Tue, 6 Apr 2021 22:48:22 UTC (8,851 KB)
[v2] Wed, 3 Nov 2021 20:49:28 UTC (19,101 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Measuring Fairness in AI: the Casual Conversations Dataset

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Measuring Fairness in AI: the Casual Conversations Dataset

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators