PASCAL - Pattern Analysis, Statistical Modelling and Computational Learning

Induction of Cross-Language Affix and Letter Sequence Correspondence
Ari Rappoport and Tsahi Levent-Levi
EACL 2006 workshop on cross-language knowledge induction 2006.


We introduce the problem of explicit modeling of form relationships between words in different languages, focusing here on languages having an alphabetic writing system and affixal morphology. We present an algorithm that learns the cross-language correspondence between affixes and letter sequences. The algorithm does not assume prior knowledge of affixes in any of the languages, using only a simple single letter correspondence as seed. Results are given for the English-Spanish language pair.

EPrint Type:Article
Project Keyword:Project Keyword UNSPECIFIED
Subjects:Natural Language Processing
ID Code:4096
Deposited By:Ari Rappoport
Deposited On:25 March 2008