Spanning spaces : learning cross-lingual similarities
In analyzing multilingual text corpora, we have the practical problem of computing similarities between documents in different languages. Given two documents in different languages, we use monolingual similarity to an aligned set to compute a similarity across languages. We derive several algorithms and show their relationship the choice of similarity function. We also show experimental results illustrating the approach.