Robustness Verification for Transformers

Zhouxing Shi; Huan Zhang 0001; Kai-Wei Chang 0001; Minlie Huang; Cho-Jui Hsieh

Back to ICLR

ICLR 2020

Robustness Verification for Transformers

Conference Paper Poster Presentations Artificial Intelligence · Machine Learning

Details

Abstract

Robustness verification that aims to formally certify the prediction behavior of neural networks has become an important tool for understanding model behavior and obtaining safety guarantees. However, previous methods can usually only handle neural networks with relatively simple architectures. In this paper, we consider the robustness verification problem for Transformers. Transformers have complex self-attention layers that pose many challenges for verification, including cross-nonlinearity and cross-position dependency, which have not been discussed in previous works. We resolve these challenges and develop the first robustness verification algorithm for Transformers. The certified robustness bounds computed by our method are significantly tighter than those by naive Interval Bound Propagation. These bounds also shed light on interpreting Transformers as they consistently reflect the importance of different words in sentiment analysis.

Authors

Keywords

Robustness
Verification
Transformers

Context

Venue: International Conference on Learning Representations
Archive span: 2013-2025
Indexed papers: 10294
Paper id: 806635545473801705