Anchors Selection for Cross-lingual Embedding Alignment through Time

Anchors Selection for Cross-lingual Embedding Alignment through Time

Filippo Pallucchini

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence
Doctoral Consortium. Pages 5867-5868. https://doi.org/10.24963/ijcai.2022/836

In recent years, vector representations of words have proven to be extremely useful across a wide range of NLP applications. Because of the broad interest in the topic, it became essential to answer the following question: is it possible to align different embeddings, in order to compare terms belonging to different vector spaces and their relations? While embedding alignment received considerable attention in the literature, how to find the best anchors for this process is still an open problem; in this paper, we propose an unsupervised, automatic method to select words belonging to different corpora that are close from a semantic point of view, and can be used as anchors for aligning their respective embedding spaces.
Keywords:
Speech & Natural Language Processing (SNLP): General