Learning from Discriminatory Training Data

Grabowicz, Przemyslaw A.; Perello, Nicholas; Takatsu, Kenta

Computer Science > Machine Learning

arXiv:1912.08189 (cs)

[Submitted on 17 Dec 2019 (v1), last revised 21 Apr 2023 (this version, v4)]

Title:Learning from Discriminatory Training Data

Authors:Przemyslaw A. Grabowicz, Nicholas Perello, Kenta Takatsu

View PDF

Abstract:Supervised learning systems are trained using historical data and, if the data was tainted by discrimination, they may unintentionally learn to discriminate against protected groups. We propose that fair learning methods, despite training on potentially discriminatory datasets, shall perform well on fair test datasets. Such dataset shifts crystallize application scenarios for specific fair learning methods. For instance, the removal of direct discrimination can be represented as a particular dataset shift problem. For this scenario, we propose a learning method that provably minimizes model error on fair datasets, while blindly training on datasets poisoned with direct additive discrimination. The method is compatible with existing legal systems and provides a solution to the widely discussed issue of protected groups' intersectionality by striking a balance between the protected groups. Technically, the method applies probabilistic interventions, has causal and counterfactual formulations, and is computationally lightweight - it can be used with any supervised learning model to prevent discrimination via proxies while maximizing model accuracy for business necessity.

Comments:	16 pages, 14 figures, 1 table
Subjects:	Machine Learning (cs.LG); Computers and Society (cs.CY); Physics and Society (physics.soc-ph)
ACM classes:	I.2.6; K.4.1
Cite as:	arXiv:1912.08189 [cs.LG]
	(or arXiv:1912.08189v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1912.08189

Submission history

From: Nicholas Perello [view email]
[v1] Tue, 17 Dec 2019 18:53:23 UTC (3,287 KB)
[v2] Mon, 24 Feb 2020 18:50:51 UTC (2,710 KB)
[v3] Tue, 23 Feb 2021 02:40:29 UTC (7,934 KB)
[v4] Fri, 21 Apr 2023 02:31:12 UTC (3,960 KB)

Computer Science > Machine Learning

Title:Learning from Discriminatory Training Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning from Discriminatory Training Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators