PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Entity Disambiguation using Link based Relations extracted from Wikipedia
Anja Pilz
In: AKBC 2010, Grenoble, France(2010).

Abstract

We present an approach for the disambiguation of textual mentions of ambiguous names: disambiguation means here the identification of the true entity denoted by a name phrase appearing in a query context through its assignment to the corresponding Wikipedia article. If this article does not exist, we assign this query to a default entity. Ambiguity of names is a major problem in information retrieval and causes uncertainty in the assignment of name phrases to existing knowledge base entries. We propose a kernel classifier to approach this problem and compare two Wikipedia structures to construct a rich feature space. The first approach relies on Wikipedia categories, the second on relations constructed from Wikipedia's hyper link structure. We evaluate both approaches on the German version of Wikipedia and show that both outperform a baseline approach using simple cosine similarity.

EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Natural Language Processing
Information Retrieval & Textual Information Access
ID Code:6779
Deposited By:Anja Pilz
Deposited On:05 June 2011