Using Conditional Random Fields to Exploit Token Structure and Labels for Accurate Semantic Annotation

Aman Goel; Craig Knoblock; Kristina Lerman

Back to AAAI

AAAI 2011

Using Conditional Random Fields to Exploit Token Structure and Labels for Accurate Semantic Annotation

Conference Paper Papers Artificial Intelligence

PDF Details

Abstract

Automatic semantic annotation of structured data enables unsupervised integration of data from heterogeneous sources but is difﬁcult to perform accurately due to the presence of many numeric ﬁelds and proper-noun ﬁelds that do not allow reference-based approaches and the absence of natural language text that prevents the use of language-based approaches. In addition, several of these semantic types have multiple heterogeneous representations, while sharing syntactic structure with other types. In this work, we propose a new approach to use conditional random ﬁelds (CRFs) to perform semantic annotation of structured data that takes advantage of the structure and labels of the tokens for higher accuracy of ﬁeld labeling, while still allowing the use of exact inference techniques. We compare our approach with a linear-CRF based model that only labels ﬁelds and also with a regular-expression based approach.

Using Conditional Random Fields to Exploit Token Structure and Labels for Accurate Semantic Annotation

Abstract

Authors

Keywords

Context