Corrective Retrieval Augmented Generation

AI-generated keywords: Large language models

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Large language models (LLMs) exhibit hallucinations due to limitations in accuracy based on parametric knowledge
  • Retrieval-augmented generation (RAG) proposed as a solution to complement LLMs
  • Concerns about RAG effectiveness if retrieval fails due to relevance of retrieved documents
  • Corrective Retrieval Augmented Generation (CRAG) introduced by researchers including Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, and Zhen-Hua Ling to enhance text generation robustness
  • CRAG incorporates a lightweight retrieval evaluator for assessing quality of retrieved documents and guiding knowledge retrieval actions
  • CRAG leverages large-scale web searches to augment retrieval process and focuses on key information while filtering out irrelevant content
  • Plug-and-play compatibility with various RAG-based approaches for seamless integration into existing models
  • Experimental evaluations show significant performance improvements with CRAG in mitigating hallucinations and enhancing text generation quality
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, Zhen-Hua Ling

Abstract: Large language models (LLMs) inevitably exhibit hallucinations since the accuracy of generated texts cannot be secured solely by the parametric knowledge they encapsulate. Although retrieval-augmented generation (RAG) is a practicable complement to LLMs, it relies heavily on the relevance of retrieved documents, raising concerns about how the model behaves if retrieval goes wrong. To this end, we propose the Corrective Retrieval Augmented Generation (CRAG) to improve the robustness of generation. Specifically, a lightweight retrieval evaluator is designed to assess the overall quality of retrieved documents for a query, returning a confidence degree based on which different knowledge retrieval actions can be triggered. Since retrieval from static and limited corpora can only return sub-optimal documents, large-scale web searches are utilized as an extension for augmenting the retrieval results. Besides, a decompose-then-recompose algorithm is designed for retrieved documents to selectively focus on key information and filter out irrelevant information in them. CRAG is plug-and-play and can be seamlessly coupled with various RAG-based approaches. Experiments on four datasets covering short- and long-form generation tasks show that CRAG can significantly improve the performance of RAG-based approaches.

Submitted to arXiv on 29 Jan. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2401.15884v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

, , , , Large language models (LLMs) have shown to exhibit hallucinations due to the limitations in securing the accuracy of generated texts solely based on their parametric knowledge. While retrieval-augmented generation (RAG) has been proposed as a viable solution to complement LLMs, its effectiveness heavily relies on the relevance of retrieved documents, leading to concerns about model behavior if retrieval fails. In response to this challenge, a team of researchers including Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, and Zhen-Hua Ling introduces Corrective Retrieval Augmented Generation (CRAG) as a novel approach to enhance the robustness of text generation. CRAG incorporates a lightweight retrieval evaluator designed to assess the quality of retrieved documents for a given query. This evaluator provides a confidence degree that guides different knowledge retrieval actions based on the assessed relevance. Recognizing that traditional retrieval from static and limited corpora may yield sub-optimal results, CRAG leverages large-scale web searches as an extension to augment the retrieval process. Additionally, CRAG implements a decompose-then-recompose algorithm that selectively focuses on key information within retrieved documents while filtering out irrelevant content. One notable feature of CRAG is its plug-and-play compatibility with various RAG-based approaches, allowing seamless integration into existing models. Experimental evaluations conducted across four datasets covering both short- and long-form generation tasks demonstrate significant performance improvements when employing CRAG. The results highlight CRAG's ability to mitigate hallucinations in generated texts and enhance overall text generation quality through its corrective retrieval mechanisms. In conclusion, Corrective Retrieval Augmented Generation presents a promising advancement in addressing the challenges associated with hallucinations in large language models by improving the reliability and robustness of text generation processes through enhanced retrieval strategies and selective content focus.
Created on 25 Apr. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.