PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Literality Based Sample Sorting for Syntax Projection
Bruno Cavestro and Nicola Cancedda
In: Cross-Language Knowledge Induction Workshop, 2-4 Aug 2005, Cluj-Napoca, Romania.


We consider the problem of projecting syntax trees across different sides of a parallel corpus, without using any language dependent feature. To achieve this task we introduce a literality score and use it to sort the bi-sentences of the parallel corpus in different classes. We show how to iteratively train a parser over those classes.

PDF - Requires Adobe Acrobat Reader or other PDF viewer.
EPrint Type:Conference or Workshop Item (Paper)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Computational, Information-Theoretic Learning with Statistics
Natural Language Processing
ID Code:1078
Deposited By:Bruno Cavestro
Deposited On:10 September 2005