Adversarial attack detection based on example semantics and model activation features
J Chen, T Yu, C Wu, H Zheng, W Zhao… - 2022 5th International …, 2022 - ieeexplore.ieee.org
J Chen, T Yu, C Wu, H Zheng, W Zhao, L Pang, H Li
2022 5th International Conference on Data Science and Information …, 2022•ieeexplore.ieee.orgWith the widespread application of deep learning technology, its security issue is also
gradually paid attention to. To improve the security and reliability of deep learning
technology in practical applications, we focus on the vulnerability of deep neural networks
against adversarial attacks and address the problems of existing adversarial example
detection algorithms that rely on pre-known attack types, and low detection efficiency, and
high detection costs. In this paper, we propose an adversarial attack detection method …
gradually paid attention to. To improve the security and reliability of deep learning
technology in practical applications, we focus on the vulnerability of deep neural networks
against adversarial attacks and address the problems of existing adversarial example
detection algorithms that rely on pre-known attack types, and low detection efficiency, and
high detection costs. In this paper, we propose an adversarial attack detection method …
With the widespread application of deep learning technology, its security issue is also gradually paid attention to. To improve the security and reliability of deep learning technology in practical applications, we focus on the vulnerability of deep neural networks against adversarial attacks and address the problems of existing adversarial example detection algorithms that rely on pre-known attack types, and low detection efficiency, and high detection costs. In this paper, we propose an adversarial attack detection method based on sample semantics and model activation features to provide an effective solution sample semantics for the problems of attack dependence and uninterpretable detection results of detection methods. Firstly, normal examples are input into the deep model to obtain the semantic features and model activation features. Secondly, binary classification datasets are constructed separately to train the binary classifier and then divides by the binary classification detector for we construct the adversarial samples in two parts based on relevance features and model activation features to achieve the detection of adversarial samples. Finally, in the experiments, the detection rate against different attacks under different datasets reached higher than 93.00%. At the same time, the detection algorithm in this paper still maintains high performance in the case of the attacker's known detection algorithm, the average detection rate decreases by no more than 5%.
ieeexplore.ieee.org
Showing the best result for this search. See all results