K-BERT: Enabling Language Representation with Knowledge Graph

Weijie Liu; Peng Zhou; Zhe Zhao; Zhiruo Wang; Qi Ju; Haotang Deng; Ping Wang

Back to AAAI

AAAI 2020

K-BERT: Enabling Language Representation with Knowledge Graph

Conference Paper AAAI Technical Track: Knowledge Representation and Reasoning Artificial Intelligence

PDF Details

Abstract

Pre-trained language representation models, such as BERT, capture a general language representation from large-scale corpora, but lack domain-speciﬁc knowledge. When reading a domain text, experts make inferences with relevant knowledge. For machines to achieve this capability, we propose a knowledge-enabled language representation model (K-BERT) with knowledge graphs (KGs), in which triples are injected into the sentences as domain knowledge. However, too much knowledge incorporation may divert the sentence from its correct meaning, which is called knowledge noise (KN) issue. To overcome KN, K-BERT introduces softposition and visible matrix to limit the impact of knowledge. K-BERT can easily inject domain knowledge into the models by being equipped with a KG without pre-training by itself because it is capable of loading model parameters from the pre-trained BERT. Our investigation reveals promising results in twelve NLP tasks. Especially in domain-speciﬁc tasks (including ﬁnance, law, and medicine), K-BERT signiﬁcantly outperforms BERT, which demonstrates that K-BERT is an excellent choice for solving the knowledge-driven problems that require experts.

K-BERT: Enabling Language Representation with Knowledge Graph

Abstract

Authors

Keywords

Context