Integrating Prior Knowledge in Post-hoc Explanations

Jeyasothy, Adulam; Laugel, Thibault; Lesot, Marie-Jeanne; Marsala, Christophe; Detyniecki, Marcin

Computer Science > Artificial Intelligence

arXiv:2204.11634 (cs)

[Submitted on 25 Apr 2022]

Title:Integrating Prior Knowledge in Post-hoc Explanations

Authors:Adulam Jeyasothy, Thibault Laugel, Marie-Jeanne Lesot, Christophe Marsala, Marcin Detyniecki

View PDF

Abstract:In the field of eXplainable Artificial Intelligence (XAI), post-hoc interpretability methods aim at explaining to a user the predictions of a trained decision model. Integrating prior knowledge into such interpretability methods aims at improving the explanation understandability and allowing for personalised explanations adapted to each user. In this paper, we propose to define a cost function that explicitly integrates prior knowledge into the interpretability objectives: we present a general framework for the optimization problem of post-hoc interpretability methods, and show that user knowledge can thus be integrated to any method by adding a compatibility term in the cost function. We instantiate the proposed formalization in the case of counterfactual explanations and propose a new interpretability method called Knowledge Integration in Counterfactual Explanation (KICE) to optimize it. The paper performs an experimental study on several benchmark data sets to characterize the counterfactual instances generated by KICE, as compared to reference methods.

Comments:	preprint
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2204.11634 [cs.AI]
	(or arXiv:2204.11634v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2204.11634

Submission history

From: Adulam Jeyasothy [view email]
[v1] Mon, 25 Apr 2022 13:09:53 UTC (148 KB)

Computer Science > Artificial Intelligence

Title:Integrating Prior Knowledge in Post-hoc Explanations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Integrating Prior Knowledge in Post-hoc Explanations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators