Enhancing Adversarial Defense by k-Winners-Take-All

Xiao, Chang; Zhong, Peilin; Zheng, Changxi

Computer Science > Machine Learning

arXiv:1905.10510 (cs)

[Submitted on 25 May 2019 (v1), last revised 29 Oct 2019 (this version, v3)]

Title:Enhancing Adversarial Defense by k-Winners-Take-All

Authors:Chang Xiao, Peilin Zhong, Changxi Zheng

View PDF

Abstract:We propose a simple change to existing neural network structures for better defending against gradient-based adversarial attacks. Instead of using popular activation functions (such as ReLU), we advocate the use of k-Winners-Take-All (k-WTA) activation, a C0 discontinuous function that purposely invalidates the neural network model's gradient at densely distributed input data points. The proposed k-WTA activation can be readily used in nearly all existing networks and training methods with no significant overhead. Our proposal is theoretically rationalized. We analyze why the discontinuities in k-WTA networks can largely prevent gradient-based search of adversarial examples and why they at the same time remain innocuous to the network training. This understanding is also empirically backed. We test k-WTA activation on various network structures optimized by a training method, be it adversarial training or not. In all cases, the robustness of k-WTA networks outperforms that of traditional networks under white-box attacks.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Data Structures and Algorithms (cs.DS); Machine Learning (stat.ML)
Cite as:	arXiv:1905.10510 [cs.LG]
	(or arXiv:1905.10510v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.10510

Submission history

From: Chang Xiao [view email]
[v1] Sat, 25 May 2019 03:36:40 UTC (6,313 KB)
[v2] Mon, 10 Jun 2019 20:14:15 UTC (6,313 KB)
[v3] Tue, 29 Oct 2019 00:27:18 UTC (8,425 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2019-05

Change to browse by:

cs.AI
cs.CR
cs.DS
cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chang Xiao
Peilin Zhong
Changxi Zheng

export BibTeX citation

Computer Science > Machine Learning

Title:Enhancing Adversarial Defense by k-Winners-Take-All

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Enhancing Adversarial Defense by k-Winners-Take-All

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators