Generate rather than Retrieve: Large Language Models are Strong Context Generators

AI-generated keywords: Generate-then-Read (GenRead)

AI-generated Key Points

  • The paper proposes a new approach to solving knowledge-intensive tasks using large language model generators instead of document retrievers.
  • The proposed method is called GenRead and involves prompting a large language model to generate contextual documents based on a given question and then reading the generated documents to produce the final answer.
  • The authors also propose a clustering-based prompting method that selects distinct prompts resulting in generated documents that cover different perspectives, leading to better recall over acceptable answers.
  • GenRead is demonstrated to be effective through extensive experiments on three different knowledge-intensive tasks, including open-domain QA, fact checking, and dialogue systems.
  • GenRead achieves significantly higher exact match scores than state-of-the-art retrieve-then-read pipeline DPR-FiD without retrieving any documents from external knowledge sources.
  • However, GenRead has limitations in updating knowledge state and adapting to new domains compared to retrieve-then-read methods' ability to swap in new documents when new information is learned or adding in documents from a new domain for quick adaptation to downstream tasks.
  • Future research directions include incorporating new knowledge efficiently into generate then read methods while minimizing hallucination errors in generated documents.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Wenhao Yu, Dan Iter, Shuohang Wang, Yichong Xu, Mingxuan Ju, Soumya Sanyal, Chenguang Zhu, Michael Zeng, Meng Jiang

Accepted at ICLR 2023 (v3, add code and implementation details)
License: CC BY 4.0

Abstract: Knowledge-intensive tasks, such as open-domain question answering (QA), require access to a large amount of world or domain knowledge. A common approach for knowledge-intensive tasks is to employ a retrieve-then-read pipeline that first retrieves a handful of relevant contextual documents from an external corpus such as Wikipedia and then predicts an answer conditioned on the retrieved documents. In this paper, we present a novel perspective for solving knowledge-intensive tasks by replacing document retrievers with large language model generators. We call our method generate-then-read (GenRead), which first prompts a large language model to generate contextutal documents based on a given question, and then reads the generated documents to produce the final answer. Furthermore, we propose a novel clustering-based prompting method that selects distinct prompts, resulting in the generated documents that cover different perspectives, leading to better recall over acceptable answers. We conduct extensive experiments on three different knowledge-intensive tasks, including open-domain QA, fact checking, and dialogue system. Notably, GenRead achieves 71.6 and 54.4 exact match scores on TriviaQA and WebQ, significantly outperforming the state-of-the-art retrieve-then-read pipeline DPR-FiD by +4.0 and +3.9, without retrieving any documents from any external knowledge source. Lastly, we demonstrate the model performance can be further improved by combining retrieval and generation. Our code and generated documents can be found at https://github.com/wyu97/GenRead.

Submitted to arXiv on 21 Sep. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2209.10063v3

This paper presents a novel approach to solving knowledge-intensive tasks, such as open-domain question answering (QA), by replacing document retrievers with large language model generators. The proposed method, called generate-then-read (GenRead), prompts a large language model to generate contextual documents based on a given question and then reads the generated documents to produce the final answer. The authors also propose a clustering-based prompting method that selects distinct prompts resulting in generated documents that cover different perspectives, leading to better recall over acceptable answers. The effectiveness of GenRead is demonstrated through extensive experiments on three different knowledge-intensive tasks, including open-domain QA, fact checking, and dialogue systems. Notably, GenRead achieves 71.6 and 54.4 exact match scores on TriviaQA and WebQ datasets, respectively, significantly outperforming the state-of-the-art retrieve-then-read pipeline DPR-FiD by +4.0 and +3.9 without retrieving any documents from external knowledge sources. Furthermore, the authors conducted a readability study on retrieved documents and generated documents extracted from NQ, TriviaQA, and WebQ datasets when both contain the correct answer. The results show that FiD reader can produce more correct answers when reading generated documents from large language models such as InstructGPT. Despite its strong performance on presented datasets, GenRead has limitations in updating knowledge state and adapting to new domains compared to retrieve-then-read methods' ability to swap in new documents when new information is learned or adding in documents from a new domain for quick adaptation to downstream tasks. In conclusion, this paper offers an innovative perspective for solving knowledge intensive tasks by replacing dense retrieval models with large language model generators using GenRead methodology while acknowledging future research directions for incorporating new knowledge efficiently into generate then read methods while minimizing hallucination errors in generated documents.
Created on 17 May. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.