PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Statistical Post-Editing of a Rule-Based Machine Translation System
A.-L. Lagarda, V. Alabau, Francisco Casacuberta, R. Silva and E. Díaz-de-Liaño
In: North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 conference, Boulder, Colorado(2009).


Automatic post-editing (APE) systems aim at correcting the output of machine translation systems to produce better quality translations, i.e. produce translations can be manually post-edited with an increase in productivity. In this work, we present an APE system that uses statistical models to enhance a commercial rule-based machine translation (RB) system. In addition, a procedure for effortless human evaluation has been established. We have tested the APE system with two corpora of different complexity. For the Parliament corpus, we show that the APE system significantly complements and improves the RB system. Results for the Protocols corpus, although less conclusive, are promising as well. Finally, several possible sources of errors have been identified which will help develop future system enhancements.

EPrint Type:Conference or Workshop Item (Oral)
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Natural Language Processing
ID Code:5822
Deposited By:Alfons Juan
Deposited On:08 March 2010