Word Lattices for Multi-Source Translation
Josh Schroeder, Trevor Cohn and Phillip Koehn
In: 12th Conference of the European Chapter of the ACL (EACL), 30 Mar - 3 Apr 2009, Athens, Greece.
Multi-source statistical machine translation is the process of generating a single translation from multiple inputs. Previous work has focused primarily on selecting from potential outputs of separate translation systems, and solely on multi-parallel corpora and test sets. We demonstrate how multi-source translation can be adapted for multiple monolingual inputs. We also examine different approaches to dealing with multiple sources, including consensus decoding, and we present a novel method of input combination to generate lattices for multi-source translation within a single translation model.