Certifying the Fairness of KNN in the Presence of Dataset Bias

Li, Yannan; Wang, Jingbo; Wang, Chao

Computer Science > Machine Learning

arXiv:2307.08722 (cs)

[Submitted on 17 Jul 2023]

Title:Certifying the Fairness of KNN in the Presence of Dataset Bias

Authors:Yannan Li, Jingbo Wang, Chao Wang

View PDF

Abstract:We propose a method for certifying the fairness of the classification result of a widely used supervised learning algorithm, the k-nearest neighbors (KNN), under the assumption that the training data may have historical bias caused by systematic mislabeling of samples from a protected minority group. To the best of our knowledge, this is the first certification method for KNN based on three variants of the fairness definition: individual fairness, $\epsilon$-fairness, and label-flipping fairness. We first define the fairness certification problem for KNN and then propose sound approximations of the complex arithmetic computations used in the state-of-the-art KNN algorithm. This is meant to lift the computation results from the concrete domain to an abstract domain, to reduce the computational cost. We show effectiveness of this abstract interpretation based technique through experimental evaluation on six datasets widely used in the fairness research literature. We also show that the method is accurate enough to obtain fairness certifications for a large number of test inputs, despite the presence of historical bias in the datasets.

Subjects:	Machine Learning (cs.LG); Computers and Society (cs.CY); Software Engineering (cs.SE)
Cite as:	arXiv:2307.08722 [cs.LG]
	(or arXiv:2307.08722v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2307.08722

Submission history

From: Yannan Li [view email]
[v1] Mon, 17 Jul 2023 07:09:55 UTC (598 KB)

Computer Science > Machine Learning

Title:Certifying the Fairness of KNN in the Presence of Dataset Bias

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Certifying the Fairness of KNN in the Presence of Dataset Bias

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators