Multiple Positional Self-Attention Network for Text Classification

Biyun Dai; Jinlong Li; Ruoyi Xu

Back to AAAI

AAAI 2020

Multiple Positional Self-Attention Network for Text Classification

Conference Paper AAAI Technical Track: Natural Language Processing Artificial Intelligence

PDF Details

Abstract

Self-attention mechanisms have recently caused many concerns on Natural Language Processing (NLP) tasks. Relative positional information is important to self-attention mechanisms. We propose Faraway Mask focusing on the (2m+1)gram words and Scaled-Distance Mask putting the logarithmic distance punishment to avoid and weaken the selfattention of distant words respectively. To exploit different masks, we present Positional Self-Attention Layer for generating different Masked-Self-Attentions and a following Position-Fusion Layer in which fused positional information multiplies the Masked-Self-Attentions for generating sentence embeddings. To evaluate our sentence embeddings approach Multiple Positional Self-Attention Network (MPSAN), we perform the comparison experiments on sentiment analysis, semantic relatedness and sentence classiﬁcation tasks. The result shows that our MPSAN outperforms state-of-the-art methods on ﬁve datasets and the test accuracy is improved by 0. 81%, 0. 6% on SST, CR datasets, respectively. In addition, we reduce training parameters and improve the time efﬁciency of MPSAN by lowering the dimension number of self-attention and simplifying fusion mechanism.

Multiple Positional Self-Attention Network for Text Classification

Abstract

Authors

Keywords

Context