Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Afchar, Darius; Hennequin, Romain; Guigue, Vincent

Computer Science > Machine Learning

arXiv:2104.12437 (cs)

[Submitted on 26 Apr 2021 (v1), last revised 5 Jul 2021 (this version, v2)]

Title:Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Authors:Darius Afchar, Romain Hennequin, Vincent Guigue

View PDF

Abstract:Feature attribution is often loosely presented as the process of selecting a subset of relevant features as a rationale of a prediction. Task-dependent by nature, precise definitions of "relevance" encountered in the literature are however not always consistent. This lack of clarity stems from the fact that we usually do not have access to any notion of ground-truth attribution and from a more general debate on what good interpretations are. In this paper we propose to formalise feature selection/attribution based on the concept of relaxed functional dependence. In particular, we extend our notions to the instance-wise setting and derive necessary properties for candidate selection solutions, while leaving room for task-dependence. By computing ground-truth attributions on synthetic datasets, we evaluate many state-of-the-art attribution methods and show that, even when optimised, some fail to verify the proposed properties and provide wrong solutions.

Comments:	38th International Conference on Machine Learning (ICML 2021)
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2104.12437 [cs.LG]
	(or arXiv:2104.12437v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2104.12437
Journal reference:	PMLR 139:76-86, 2021

Submission history

From: Darius Afchar [view email]
[v1] Mon, 26 Apr 2021 10:04:44 UTC (869 KB)
[v2] Mon, 5 Jul 2021 14:27:00 UTC (5,426 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-04

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Darius Afchar
Romain Hennequin
Vincent Guigue

export BibTeX citation

Computer Science > Machine Learning

Title:Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators