Robustness Verification for Transformers

Shi, Zhouxing; Zhang, Huan; Chang, Kai-Wei; Huang, Minlie; Hsieh, Cho-Jui

Computer Science > Machine Learning

arXiv:2002.06622 (cs)

[Submitted on 16 Feb 2020 (v1), last revised 23 Dec 2020 (this version, v2)]

Title:Robustness Verification for Transformers

Authors:Zhouxing Shi, Huan Zhang, Kai-Wei Chang, Minlie Huang, Cho-Jui Hsieh

View PDF

Abstract:Robustness verification that aims to formally certify the prediction behavior of neural networks has become an important tool for understanding model behavior and obtaining safety guarantees. However, previous methods can usually only handle neural networks with relatively simple architectures. In this paper, we consider the robustness verification problem for Transformers. Transformers have complex self-attention layers that pose many challenges for verification, including cross-nonlinearity and cross-position dependency, which have not been discussed in previous works. We resolve these challenges and develop the first robustness verification algorithm for Transformers. The certified robustness bounds computed by our method are significantly tighter than those by naive Interval Bound Propagation. These bounds also shed light on interpreting Transformers as they consistently reflect the importance of different words in sentiment analysis.

Comments:	ICLR 2020
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2002.06622 [cs.LG]
	(or arXiv:2002.06622v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2002.06622

Submission history

From: Zhouxing Shi [view email]
[v1] Sun, 16 Feb 2020 17:16:31 UTC (587 KB)
[v2] Wed, 23 Dec 2020 12:36:47 UTC (589 KB)

Computer Science > Machine Learning

Title:Robustness Verification for Transformers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Robustness Verification for Transformers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators