Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking

AI-generated keywords: Passage Re-ranking Pre-trained Language Models Knowledge Graph Knowledge Meta Graph GMN Module

AI-generated Key Points

  • Passage re-ranking is a crucial task in information retrieval
  • Pre-trained Language Models (PLMs) have shown potential in improving re-rankers
  • Existing PLM-based re-rankers face challenges such as vocabulary mismatch and lack of domain-specific knowledge
  • The proposed approach incorporates explicit knowledge from a knowledge graph into the re-ranking process
  • A knowledge meta graph is introduced to distill reliable knowledge from the original graph
  • PLM is used as the text encoder and a graph neural network is used as the knowledge encoder
  • A novel knowledge injector facilitates dynamic interaction between the text encoder and knowledge encoder
  • The method includes a GMN module within the knowledge injector to refine explicit knowledge based on learned text context features
  • Experimental results demonstrate the effectiveness of the approach, especially for queries requiring domain-specific knowledge
  • Our approach outperforms baselines that leverage external resources or incorporate domain-specific information
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Qian Dong, Yiding Liu, Suqi Cheng, Shuaiqiang Wang, Zhicong Cheng, Shuzi Niu, Dawei Yin

License: CC BY 4.0

Abstract: Passage re-ranking is to obtain a permutation over the candidate passage set from retrieval stage. Re-rankers have been boomed by Pre-trained Language Models (PLMs) due to their overwhelming advantages in natural language understanding. However, existing PLM based re-rankers may easily suffer from vocabulary mismatch and lack of domain specific knowledge. To alleviate these problems, explicit knowledge contained in knowledge graph is carefully introduced in our work. Specifically, we employ the existing knowledge graph which is incomplete and noisy, and first apply it in passage re-ranking task. To leverage a reliable knowledge, we propose a novel knowledge graph distillation method and obtain a knowledge meta graph as the bridge between query and passage. To align both kinds of embedding in the latent space, we employ PLM as text encoder and graph neural network over knowledge meta graph as knowledge encoder. Besides, a novel knowledge injector is designed for the dynamic interaction between text and knowledge encoder. Experimental results demonstrate the effectiveness of our method especially in queries requiring in-depth domain knowledge.

Submitted to arXiv on 25 Apr. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2204.11673v1

Passage re-ranking is a crucial task in information retrieval and Pre-trained Language Models (PLMs) have shown great potential in improving the performance of re-rankers. However, existing PLM-based re-rankers often face challenges such as vocabulary mismatch and lack of domain-specific knowledge. To address these issues, we propose a novel approach that incorporates explicit knowledge from a knowledge graph into the re-ranking process. In our method, we leverage an existing but incomplete and noisy knowledge graph to enhance passage re-ranking. We introduce a knowledge meta graph that serves as a bridge between the query and passage by distilling reliable knowledge from the original graph. To align the embeddings of both text and knowledge in the latent space, we use PLM as the text encoder and employ a graph neural network over the knowledge meta graph as the knowledge encoder. To facilitate dynamic interaction between the text encoder and knowledge encoder, we design a novel knowledge injector. This injector allows for seamless integration of explicit knowledge into the re-ranking process. Inspired by CokeBERT, our method includes a GMN module within the knowledge injector to refine the context of explicit knowledge based on the learned text context features. Experimental results demonstrate that our approach is effective, particularly for queries that require in-depth domain knowledge. By incorporating explicit knowledge from a carefully curated knowledge graph, our method overcomes vocabulary mismatch and enhances domain-specific understanding in passage re-ranking tasks. In addition to our proposed approach, we also compare it with several baselines to evaluate its performance. These baselines include methods that focus on leveraging external resources or incorporating domain-specific information. Our approach outperforms these baselines, further highlighting its effectiveness in capturing relevant information for passage re-ranking tasks. Overall, our work demonstrates how incorporating explicit domain specific knowledge from a curated graph can significantly improve PLM based passage re ranking systems by addressing vocabulary mismatch and enhancing domain specific understanding; thus providing more accurate and relevant results for queries that require in depth understanding of their subject matter.
Created on 31 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.