Efficient Adversarial Training with Robust Early-Bird Tickets

Xi, Zhiheng; Zheng, Rui; Gui, Tao; Zhang, Qi; Huang, Xuanjing

Computer Science > Computation and Language

arXiv:2211.07263 (cs)

[Submitted on 14 Nov 2022 (v1), last revised 30 Nov 2022 (this version, v3)]

Title:Efficient Adversarial Training with Robust Early-Bird Tickets

Authors:Zhiheng Xi, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

View PDF

Abstract:Adversarial training is one of the most powerful methods to improve the robustness of pre-trained language models (PLMs). However, this approach is typically more expensive than traditional fine-tuning because of the necessity to generate adversarial examples via gradient descent. Delving into the optimization process of adversarial training, we find that robust connectivity patterns emerge in the early training phase (typically $0.15\sim0.3$ epochs), far before parameters converge. Inspired by this finding, we dig out robust early-bird tickets (i.e., subnetworks) to develop an efficient adversarial training method: (1) searching for robust tickets with structured sparsity in the early stage; (2) fine-tuning robust tickets in the remaining time. To extract the robust tickets as early as possible, we design a ticket convergence metric to automatically terminate the searching process. Experiments show that the proposed efficient adversarial training method can achieve up to $7\times \sim 13 \times$ training speedups while maintaining comparable or even better robustness compared to the most competitive state-of-the-art adversarial training methods.

Comments:	EMNLP 2022
Subjects:	Computation and Language (cs.CL)
MSC classes:	68-06
ACM classes:	I.2
Cite as:	arXiv:2211.07263 [cs.CL]
	(or arXiv:2211.07263v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2211.07263

Submission history

From: Rui Zheng [view email]
[v1] Mon, 14 Nov 2022 10:44:25 UTC (8,694 KB)
[v2] Tue, 15 Nov 2022 01:34:39 UTC (8,694 KB)
[v3] Wed, 30 Nov 2022 04:30:55 UTC (8,697 KB)

Computer Science > Computation and Language

Title:Efficient Adversarial Training with Robust Early-Bird Tickets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Efficient Adversarial Training with Robust Early-Bird Tickets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators