PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

A Geometric view on bilingual lexicon extraction from comparable corpora
Eric Gaussier, Jean-Michel Renders, Irina Matveeva, Cyril Goutte and Herve Dejean
In: 42nd Annual Meeting of the Association for Computational Linguistics, July 25-26, 2004, Barcelona, Spain.

Abstract

We adopt in this study a geometric view on bilingual lexicon extraction from comparable corpora. This view makes it possible to re-interpret the methods proposed so far and identify unresolved problems. We then motivate and formulate three new methods, partly inspired by latent semantic analysis, that aim at solving these problems. We finally evaluate these methods. showing their strengths and weaknesses. Our final results show a significant gain in the accuracyof extracted lexicons.

PDF - PASCAL Members only - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Paper)
Additional Information:http://www.xrce.xerox.com/Publications/Display-Abstract.php?ReportID=1212
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Learning/Statistics & Optimisation
Natural Language Processing
Information Retrieval & Textual Information Access
ID Code:553
Deposited By:Cyril Goutte
Deposited On:25 December 2004