Edge: Enriching Knowledge Graph Embeddings with External Text

AI-generated keywords: Knowledge Graph Embedding Learning EDGE Framework Link Prediction Node Classification

AI-generated Key Points

  • Authors propose a framework called EDGE for knowledge graph enrichment and embedding
  • Goal is to address sparsity in knowledge graphs by integrating a learning component
  • EDGE constructs a graph using external text based on semantic and structural similarities
  • Aligns the constructed graph with the original knowledge graph in the same embedding space
  • Infuses learning through graph alignment to ensure similar entities are close together and dissimilar entities are pushed apart
  • Integration of information from an auxiliary textual source enhances the quality of low-dimensional embeddings
  • Takes input of a knowledge graph (KG) and an external source of texts (T) to generate an augmented knowledge graph (aKG)
  • Multi-criteria objective function is devised to align KG and aKG by minimizing distance between their embeddings
  • Extensive experiments show that EDGE outperforms state-of-the-art models in link prediction and node classification tasks
  • Contributions include proposing EDGE, introducing procedure for generating augmented knowledge graph, and novel embedding approach with multi-criteria objective function optimization
  • Paper is organized into sections discussing gaps in existing literature, problem definition, evaluation through experiments, conclusion, and future directions
  • Related work focuses on heterogeneous knowledge graphs while this work focuses on entity embedding learning
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Saed Rezayi, Handong Zhao, Sungchul Kim, Ryan A. Rossi, Nedim Lipka, Sheng Li

Accepted in NAACL'21
License: CC BY-NC-SA 4.0

Abstract: Knowledge graphs suffer from sparsity which degrades the quality of representations generated by various methods. While there is an abundance of textual information throughout the web and many existing knowledge bases, aligning information across these diverse data sources remains a challenge in the literature. Previous work has partially addressed this issue by enriching knowledge graph entities based on "hard" co-occurrence of words present in the entities of the knowledge graphs and external text, while we achieve "soft" augmentation by proposing a knowledge graph enrichment and embedding framework named Edge. Given an original knowledge graph, we first generate a rich but noisy augmented graph using external texts in semantic and structural level. To distill the relevant knowledge and suppress the introduced noise, we design a graph alignment term in a shared embedding space between the original graph and augmented graph. To enhance the embedding learning on the augmented graph, we further regularize the locality relationship of target entity based on negative sampling. Experimental results on four benchmark datasets demonstrate the robustness and effectiveness of Edge in link prediction and node classification.

Submitted to arXiv on 11 Apr. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2104.04909v1

In this paper, the authors propose a novel framework called EDGE for knowledge graph enrichment and embedding to address the issue of sparsity in knowledge graphs. The goal is to integrate a learning component into the process to improve the quality of representations generated by various methods. EDGE first constructs a graph using external text based on semantic and structural similarities and aligns it with the original knowledge graph in the same embedding space. By infusing learning in the knowledge distillation process through graph alignment, EDGE ensures that similar entities remain close together while dissimilar entities are pushed further apart. This integration of information from an auxiliary textual source helps enhance the quality of low-dimensional embeddings by introducing new features. The framework takes as input a knowledge graph (KG) and an external source of texts (T) and generates an augmented knowledge graph (aKG). The generation of aKG considers semantic and structural similarities among KG entities and ensures that all original entities are included. This facilitates the alignment process between KG and aKG in the embedding space. To align KG and aKG, a multi-criteria objective function is devised, which minimizes the distance between their embeddings. Textual nodes related to each target entity are rewarded while unrelated ones are penalized using negative sampling. Extensive experiments on four benchmark datasets demonstrate that EDGE outperforms state-of-the-art models in tasks such as link prediction and node classification. The evaluation results also confirm the generalizability of the model. The contributions of this work include: proposing EDGE, introducing a procedure to generate an augmented knowledge graph from external texts that is linked with the original knowledge graph; proposing a novel knowledge graph embedding approach that optimizes a multi-criteria objective function to align two knowledge graphs in a joint embedding space; demonstrating effectiveness and generalizability through evaluations on link prediction and node classification tasks on four datasets. The rest of the paper is organized as follows: Section 2 identifies gaps in existing literature; Section 3 provides problem definition with detailed explanation of proposed model; Section 4 evaluates model through experiments on link prediction and node classification presenting results along with ablation study; finally concluding work discussing future directions in Section 5. In terms of related work, many approaches focus on heterogeneous knowledge graphs with different types of edges whereas this work specifically considers only one type relation focusing solely on entity embedding learning; additionally mentioning its relation to Graph Neural Networks such as Graph Convolutional Networks (GCN).
Created on 31 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.