The Mythos of Model Interpretability

Lipton, Zachary C.

Computer Science > Machine Learning

arXiv:1606.03490 (cs)

[Submitted on 10 Jun 2016 (v1), last revised 6 Mar 2017 (this version, v3)]

Title:The Mythos of Model Interpretability

Authors:Zachary C. Lipton

View PDF

Abstract:Supervised machine learning models boast remarkable predictive capabilities. But can you trust your model? Will it work in deployment? What else can it tell you about the world? We want models to be not only good, but interpretable. And yet the task of interpretation appears underspecified. Papers provide diverse and sometimes non-overlapping motivations for interpretability, and offer myriad notions of what attributes render models interpretable. Despite this ambiguity, many papers proclaim interpretability axiomatically, absent further explanation. In this paper, we seek to refine the discourse on interpretability. First, we examine the motivations underlying interest in interpretability, finding them to be diverse and occasionally discordant. Then, we address model properties and techniques thought to confer interpretability, identifying transparency to humans and post-hoc explanations as competing notions. Throughout, we discuss the feasibility and desirability of different notions, and question the oft-made assertions that linear models are interpretable and that deep neural networks are not.

Comments:	presented at 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, NY
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:1606.03490 [cs.LG]
	(or arXiv:1606.03490v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1606.03490

Submission history

From: Zachary Lipton [view email]
[v1] Fri, 10 Jun 2016 21:28:47 UTC (55 KB)
[v2] Thu, 16 Jun 2016 21:21:04 UTC (55 KB)
[v3] Mon, 6 Mar 2017 08:51:10 UTC (368 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2016-06

Change to browse by:

cs
cs.AI
cs.CV
cs.NE
stat
stat.ML

References & Citations

7 blog links

(what is this?)

DBLP - CS Bibliography

listing | bibtex

Zachary Chase Lipton

export BibTeX citation

Computer Science > Machine Learning

Title:The Mythos of Model Interpretability

Submission history

Access Paper:

References & Citations

7 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Mythos of Model Interpretability

Submission history

Access Paper:

References & Citations

7 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators