PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Assessing the Challenge of Fine-grained Named Entity Recognition and Classification
Asif Ekbal, Eva Sourjikova, Anette Frank and Simone Ponzetto
In: Named Entities Workshop 2010, Uppsala, Sweden(2010).

Abstract

Named Entity Recognition and Classi- fication (NERC) is a well-studied NLP task typically focused on coarse-grained named entity (NE) classes. NERC for more fine-grained semantic NE classes has not been systematically studied. This pa- per quantifies the difficulty of fine-grained NERC (FG-NERC) when performed at large scale on the people domain. We apply unsupervised acquisition methods to construct a gold standard dataset for FG-NERC. This dataset is used to bench- mark methods for classifying NEs at var- ious levels of fine-grainedness using clas- sical NERC techniques and global contex- tual information inspired from Word Sense Disambiguation approaches. Our results indicate high difficulty of the task and pro- vide a ‘strong’ baseline for future research.

EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Natural Language Processing
ID Code:7517
Deposited By:Sebastian Pado
Deposited On:17 March 2011