Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

AI-generated keywords: Retrieval-augmented generation Pre-trained language models Knowledge-intensive tasks Parametric memory Non-parametric memory

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Large pre-trained language models have limitations in accessing and manipulating knowledge
  • Performance of these models is not as good as task-specific architectures on knowledge-intensive tasks
  • The authors propose retrieval-augmented generation (RAG) models that combine pre-trained parametric and non-parametric memory for language generation
  • RAG models use a pre-trained seq2seq model as parametric memory and a dense vector index of Wikipedia accessed through a pre-trained neural retriever as non-parametric memory
  • Two formulations of RAG models are compared: one with the same retrieved passages throughout the generated sequence, and another with different passages per token
  • Fine-tuning and evaluation on various knowledge-intensive NLP tasks show state-of-the-art results on three open domain QA tasks, outperforming parametric seq2seq models and task-specific retrieve and extract architectures
  • RAG models generate more specific, diverse, and factual language compared to a state-of-the-art parametric-only seq2seq baseline for language generation tasks
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Patrick Lewis, Ethan Perez, Aleksandara Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela

Abstract: Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for their decisions and updating their world knowledge remain open research problems. Pre-trained models with a differentiable access mechanism to explicit non-parametric memory can overcome this issue, but have so far been only investigated for extractive downstream tasks. We explore a general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation. We introduce RAG models where the parametric memory is a pre-trained seq2seq model and the non-parametric memory is a dense vector index of Wikipedia, accessed with a pre-trained neural retriever. We compare two RAG formulations, one which conditions on the same retrieved passages across the whole generated sequence, the other can use different passages per token. We fine-tune and evaluate our models on a wide range of knowledge-intensive NLP tasks and set the state-of-the-art on three open domain QA tasks, outperforming parametric seq2seq models and task-specific retrieve-and-extract architectures. For language generation tasks, we find that RAG models generate more specific, diverse and factual language than a state-of-the-art parametric-only seq2seq baseline.

Submitted to arXiv on 22 May. 2020

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2005.11401v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper discusses the limitations of large pre-trained language models in accessing and manipulating knowledge, particularly on knowledge-intensive tasks. While these models excel in fine-tuning on downstream NLP tasks, their performance falls behind task-specific architectures. To address this issue, the authors propose a solution by introducing retrieval-augmented generation (RAG) models that combine pre-trained parametric and non-parametric memory for language generation. The parametric memory is a pre-trained seq2seq model while the non-parametric memory is a dense vector index of Wikipedia accessed through a pre-trained neural retriever. Two formulations of RAG models are compared: one that conditions on the same retrieved passages throughout the generated sequence and another that allows different passages per token. Fine-tuning and evaluation on various knowledge intensive NLP tasks show state of the art results on three open domain QA tasks outperforming both parametric seq2seq models and task specific retrieve and extract architectures. Additionally, RAG models generate more specific, diverse and factual language compared to a state of the art parametric only seq2seq baseline for language generation tasks. The paper is authored by Patrick Lewis, Ethan Perez, Aleksandara Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler Mike Lewis Wen tau Yih Tim Rocktäschel Sebastian Riedel and Douwe Kiela.
Created on 20 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.