Unsupervised Embedding Enhancements of Knowledge Graphs using Textual Associations

Unsupervised Embedding Enhancements of Knowledge Graphs using Textual Associations

Neil Veira, Brian Keng, Kanchana Padmanabhan, Andreas Veneris

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
Main track. Pages 5218-5225. https://doi.org/10.24963/ijcai.2019/725

Knowledge graph embeddings are instrumental for representing and learning from multi-relational data, with recent embedding models showing high effectiveness for inferring new facts from existing databases. However, such precisely structured data is usually limited in quantity and in scope. Therefore, to fully optimize the embeddings it is important to also consider more widely available sources of information such as text. This paper describes an unsupervised approach to incorporate textual information by augmenting entity embeddings with embeddings of associated words. The approach does not modify the optimization objective for the knowledge graph embedding, which allows it to be integrated with existing embedding models. Two distinct forms of textual data are considered, with different embedding enhancements proposed for each case. In the first case, each entity has an associated text document that describes it. In the second case, a text document is not available, and instead entities occur as words or phrases in an unstructured corpus of text fragments. Experiments show that both methods can offer improvement on the link prediction task when applied to many different knowledge graph embedding models.
Keywords:
Natural Language Processing: Knowledge Extraction
Natural Language Processing: Embeddings