Saliency strikes back: How filtering out high frequencies improves white-box explanations

Muzellec, Sabine; Fel, Thomas; Boutin, Victor; andéol, Léo; VanRullen, Rufin; Serre, Thomas

Computer Science > Artificial Intelligence

arXiv:2307.09591 (cs)

[Submitted on 18 Jul 2023 (v1), last revised 7 Jun 2024 (this version, v4)]

Title:Saliency strikes back: How filtering out high frequencies improves white-box explanations

Authors:Sabine Muzellec, Thomas Fel, Victor Boutin, Léo andéol, Rufin VanRullen, Thomas Serre

View PDF HTML (experimental)

Abstract:Attribution methods correspond to a class of explainability methods (XAI) that aim to assess how individual inputs contribute to a model's decision-making process. We have identified a significant limitation in one type of attribution methods, known as ``white-box" methods. Although highly efficient, as we will show, these methods rely on a gradient signal that is often contaminated by high-frequency artifacts. To overcome this limitation, we introduce a new approach called "FORGrad". This simple method effectively filters out these high-frequency artifacts using optimal cut-off frequencies tailored to the unique characteristics of each model architecture. Our findings show that FORGrad consistently enhances the performance of already existing white-box methods, enabling them to compete effectively with more accurate yet computationally demanding "black-box" methods. We anticipate that our research will foster broader adoption of simpler and more efficient white-box methods for explainability, offering a better balance between faithfulness and computational efficiency.

Subjects:	Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2307.09591 [cs.AI]
	(or arXiv:2307.09591v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2307.09591

Submission history

From: Sabine Muzellec [view email]
[v1] Tue, 18 Jul 2023 19:56:20 UTC (11,241 KB)
[v2] Fri, 29 Mar 2024 13:04:03 UTC (14,729 KB)
[v3] Tue, 2 Apr 2024 08:55:51 UTC (14,729 KB)
[v4] Fri, 7 Jun 2024 18:39:49 UTC (15,914 KB)

Computer Science > Artificial Intelligence

Title:Saliency strikes back: How filtering out high frequencies improves white-box explanations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Saliency strikes back: How filtering out high frequencies improves white-box explanations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators