PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Unsupervised domain adaptation based on text relatedness
George Petasis
In: Recent Advances in Natural Language Processing (RANLP 2011), 12-14 Sep 2011, Hissar, Boulgaria.


In this paper an unsupervised approach to do-main adaptation is presented, which exploits external knowledge sources in order to port a classification model into a new thematic do-main. Our approach extracts a new feature set from documents of the target domain, and tries to align the new features to the original ones, by exploiting text relatedness from external knowledge sources, such as WordNet. The approach has been evaluated on the task of document classification, involving the classification of newsgroup postings into 20 news groups.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Poster)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
Natural Language Processing
ID Code:9387
Deposited By:Georgios Petasis
Deposited On:16 March 2012