Towards Evaluating the Robustness of Deep Diagnostic Models by Adversarial Attack

Xu, Mengting; Zhang, Tao; Li, Zhongnian; Liu, Mingxia; Zhang, Daoqiang

doi:10.1016/j.media.2021.101977

Computer Science > Computer Vision and Pattern Recognition

arXiv:2103.03438 (cs)

[Submitted on 5 Mar 2021]

Title:Towards Evaluating the Robustness of Deep Diagnostic Models by Adversarial Attack

Authors:Mengting Xu, Tao Zhang, Zhongnian Li, Mingxia Liu, Daoqiang Zhang

View PDF

Abstract:Deep learning models (with neural networks) have been widely used in challenging tasks such as computer-aided disease diagnosis based on medical images. Recent studies have shown deep diagnostic models may not be robust in the inference process and may pose severe security concerns in clinical practice. Among all the factors that make the model not robust, the most serious one is adversarial examples. The so-called "adversarial example" is a well-designed perturbation that is not easily perceived by humans but results in a false output of deep diagnostic models with high confidence. In this paper, we evaluate the robustness of deep diagnostic models by adversarial attack. Specifically, we have performed two types of adversarial attacks to three deep diagnostic models in both single-label and multi-label classification tasks, and found that these models are not reliable when attacked by adversarial example. We have further explored how adversarial examples attack the models, by analyzing their quantitative classification results, intermediate features, discriminability of features and correlation of estimated labels for both original/clean images and those adversarial ones. We have also designed two new defense methods to handle adversarial examples in deep diagnostic models, i.e., Multi-Perturbations Adversarial Training (MPAdvT) and Misclassification-Aware Adversarial Training (MAAdvT). The experimental results have shown that the use of defense methods can significantly improve the robustness of deep diagnostic models against adversarial attacks.

Comments:	This version was accepted in the journal Medical Image Analysis (MedIA)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2103.03438 [cs.CV]
	(or arXiv:2103.03438v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2103.03438
Journal reference:	Medical Image Analysis 69 (2021): 101977
Related DOI:	https://doi.org/10.1016/j.media.2021.101977

Submission history

From: Tao Zhang [view email]
[v1] Fri, 5 Mar 2021 02:24:47 UTC (7,602 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Evaluating the Robustness of Deep Diagnostic Models by Adversarial Attack

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Evaluating the Robustness of Deep Diagnostic Models by Adversarial Attack

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators