RAFT: Adapting Language Model to Domain Specific RAG

AI-generated keywords: Natural Language Processing

AI-generated Key Points

  • Pretraining Large Language Models (LLMs) on extensive textual data is a standard practice in natural language processing.
  • Incorporating new knowledge into pretrained LLMs is crucial for various downstream applications.
  • Methods for integrating new knowledge include RAG-based prompting and fine-tuning.
  • Retrieval Augmented FineTuning (RAFT) offers a novel training recipe to enhance reasoning capabilities within specific domains.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Tianjun Zhang, Shishir G. Patil, Naman Jain, Sheng Shen, Matei Zaharia, Ion Stoica, Joseph E. Gonzalez

License: CC BY 4.0

Abstract: Pretraining Large Language Models (LLMs) on large corpora of textual data is now a standard paradigm. When using these LLMs for many downstream applications, it is common to additionally bake in new knowledge (e.g., time-critical news, or private domain knowledge) into the pretrained model either through RAG-based-prompting, or fine-tuning. However, the optimal methodology for the model to gain such new knowledge remains an open question. In this paper, we present Retrieval Augmented FineTuning (RAFT), a training recipe that improves the model's ability to answer questions in a "open-book" in-domain settings. In RAFT, given a question, and a set of retrieved documents, we train the model to ignore those documents that don't help in answering the question, which we call, distractor documents. RAFT accomplishes this by citing verbatim the right sequence from the relevant document that would help answer the question. This coupled with RAFT's chain-of-thought-style response helps improve the model's ability to reason. In domain-specific RAG, RAFT consistently improves the model's performance across PubMed, HotpotQA, and Gorilla datasets, presenting a post-training recipe to improve pre-trained LLMs to in-domain RAG. RAFT's code and demo are open-sourced at github.com/ShishirPatil/gorilla.

Submitted to arXiv on 15 Mar. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2403.10131v1

, , , , In the realm of natural language processing, pretraining Large Language Models (LLMs) on extensive textual data has become a standard practice. These LLMs are often utilized in various downstream applications, where incorporating new knowledge into the pretrained model is crucial. This additional knowledge could range from time-sensitive news updates to domain-specific information. The methods commonly employed for integrating this new knowledge include RAG-based prompting and fine-tuning. However, the most effective approach for imbuing these models with fresh knowledge remains an open question. <nl> Natural language processing has become a standard practice for pretraining Large Language Models (LLMs). Incorporating new knowledge into these models is crucial for their use in downstream applications. Retrieval Augmented FineTuning (RAFT) offers a novel training recipe to enhance reasoning capabilities within specific domains by disregarding irrelevant documents and extracting relevant information verbatim from appropriate sources.
Created on 18 Mar. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.