The authors of this study build upon previous research to enhance the performance of Large Language Models (LLMs) through knowledge injection. By utilizing entity triplets and summaries extracted from Wikipedia API, they create a set of 54K training samples for the NBA domain. Unlike previous approaches, they preserve the triplet format to maintain naturalness in generated responses. The model is trained using a special token "TRUE_FACT:" and a causal language model objective due to its decoder-only architecture. Two settings for knowledge injection are experimented with: Intermediate tuning where finetuning is done exclusively on knowledge text before SFT data, and Combined tuning where both types of data are jointly finetuned. The effectiveness of these techniques is evaluated along with knowledge retention during intermediate finetuning stages. Additionally, the authors introduce HaloCheck, a lightweight BlackBox framework for quantifying hallucinations in LLMs without requiring extensive computational resources or question generation modules. Comparisons with selfcheckGPT-NLI show its efficiency in detecting subtle contradictions within sampled responses through quantitative and qualitative analyses. This study contributes valuable insights into reducing hallucinations in low-parameter LLMs and introduces a novel framework for evaluating hallucination severity in generated responses. It also paves the way for future research to expand these approaches across multiple domains and improve model performance in challenging tasks.
- - Authors enhance performance of Large Language Models (LLMs) through knowledge injection
- - Utilize entity triplets and summaries from Wikipedia API to create 54K training samples for NBA domain
- - Preserve triplet format for naturalness in generated responses
- - Model trained using special token "TRUE_FACT:" and causal language model objective
- - Experiment with two settings for knowledge injection: Intermediate tuning and Combined tuning
- - Evaluate effectiveness of techniques and knowledge retention during intermediate finetuning stages
- - Introduce HaloCheck, a lightweight BlackBox framework for quantifying hallucinations in LLMs
- - Compare with selfcheckGPT-NLI to show efficiency in detecting contradictions in responses
- - Contribute insights into reducing hallucinations in low-parameter LLMs and introduce novel evaluation framework
- - Pave way for future research to expand approaches across domains and improve model performance
SummaryAuthors make big language models better by adding knowledge. They use facts and summaries from Wikipedia about NBA to teach the model. The model keeps the facts in a certain format to sound natural. It learns using a special word and specific goals. They try two ways to add knowledge: one in the middle and one combined. They check how well it works and if the model remembers what it learned.
Definitions- Authors: People who write books, articles, or research.
- Large Language Models (LLMs): Big computer programs that understand and generate human language.
- Entity triplets: Sets of three related pieces of information.
- Summaries: Short explanations of longer texts.
- Wikipedia API: A tool that lets you access information from Wikipedia automatically.
- Training samples: Examples used to teach a computer program.
- NBA domain: Information related to basketball teams, players, and games.
- Triplet format: Keeping information in groups of three for easier understanding.
- Causal language model objective: Specific goals for teaching a language model how cause-and-effect relationships work.
- Knowledge injection: Adding new information to improve learning or performance.
- Intermediate tuning: Adjusting settings during training at a middle stage.
- Combined tuning: Making changes by mixing different methods together.
- Effectiveness: How well something works or achieves its goal.
- Knowledge retention: Remembering what was learned over time.
- Finetuning stages: Different steps taken to improve a model's performance further after initial training is done.
Introduction:
Large Language Models (LLMs) have been making significant strides in natural language processing tasks, such as text generation and question-answering. However, these models often suffer from a common issue known as "hallucinations," where they generate responses that are not supported by the given input or context. This can lead to inaccurate and unreliable outputs, hindering their performance in real-world applications.
In order to address this problem, a team of researchers has recently published a paper titled "Knowledge Injection for Reducing Hallucinations in Large Language Models" where they propose a novel approach to enhance LLMs' performance through knowledge injection. In this blog article, we will dive into the details of this research paper and discuss its contributions towards reducing hallucinations in LLMs.
Background:
Previous research has shown that incorporating external knowledge into LLMs can improve their performance on various tasks. However, most approaches use large amounts of data and complex architectures, making them computationally expensive and difficult to implement.
The authors build upon previous studies by utilizing entity triplets and summaries extracted from Wikipedia API to create a set of 54K training samples specifically for the NBA domain. Unlike previous methods that convert triplets into sentences or paragraphs, the authors preserve the triplet format to maintain naturalness in generated responses.
Methodology:
The proposed model is trained using a special token "TRUE_FACT:" along with a causal language model objective due to its decoder-only architecture. The authors experiment with two settings for knowledge injection: Intermediate tuning where finetuning is done exclusively on knowledge text before SFT data, and Combined tuning where both types of data are jointly finetuned.
To evaluate the effectiveness of these techniques, the authors measure knowledge retention during intermediate finetuning stages. They also introduce HaloCheck - a lightweight BlackBox framework for quantifying hallucinations without requiring extensive computational resources or question generation modules.
Results:
The results show that both intermediate and combined tuning significantly reduce hallucinations in LLMs. However, the combined tuning approach outperforms the intermediate tuning method, indicating that jointly finetuning on both knowledge text and SFT data is more effective in reducing hallucinations.
Furthermore, HaloCheck proves to be an efficient tool for detecting subtle contradictions within sampled responses through quantitative and qualitative analyses. The authors compare it with selfcheckGPT-NLI and show its superiority in identifying hallucinations.
Conclusion:
This research paper presents a valuable contribution towards reducing hallucinations in low-parameter LLMs by introducing a novel approach of knowledge injection using entity triplets and summaries from Wikipedia API. The use of a special token "TRUE_FACT:" along with causal language model objective makes this method computationally efficient compared to previous approaches.
Moreover, the introduction of HaloCheck provides a lightweight framework for evaluating hallucination severity in generated responses without requiring extensive computational resources or question generation modules. This not only aids in improving LLM performance but also paves the way for future research to expand these approaches across multiple domains and tasks.
In conclusion, this study highlights the importance of incorporating external knowledge into LLMs to improve their performance while addressing common issues such as hallucinations. It also opens up avenues for further exploration and advancements in this field.