PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Entity Disambiguation using Link based Relations extracted from Wikipedia.
Anja Pilz
In: AKBC 2010, Grenoble, France(2010).


We present an approach for the disambiguation of textual mentions of ambiguous names: disambiguation means here the identification of the true entity denoted by a name phrase appearing in a query context through its assignment to the corresponding Wikipedia article. If this article does not exist, we assign this query to a default entity. Ambiguity of names is a major problem in information retrieval and causes uncertainty in the assignment of name phrases to existing knowledge base entries. We propose a kernel classier to approach this problem and compare two Wikipedia structures to construct a rich feature space. The first approach relies on Wikipedia categories, the second on relations constructed from Wikipedia's hyper link structure. We evaluate both approaches on the German version of Wikipedia and show that both outperform a baseline approach using simple cosine similarity.

PDF - PASCAL Members only - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Natural Language Processing
Information Retrieval & Textual Information Access
ID Code:8154
Deposited By:Anja Pilz
Deposited On:03 June 2011