Trusting Your Evidence: Hallucinate Less with Context-aware Decoding

AI-generated keywords: Context-aware decoding Language models Factuality Hyperparameter Summarization

AI-generated Key Points

  • Language models struggle with paying enough attention to input context
  • Context-aware decoding (CAD) is proposed as a solution
  • CAD amplifies the difference between output probabilities with and without context
  • CAD significantly improves the faithfulness of language models for summarization tasks
  • LLaMA shows a 14.3% gain in factuality metrics with CAD
  • CAD overrides a model's prior knowledge when it contradicts the provided context
  • CAD leads to substantial improvements in resolving knowledge conflicts
  • Hyperparameter α controls the adjustment level, with α = 0.5 generally yielding good results
  • CAD outperforms standard decoding algorithms on CNN-DM and XSUM datasets
  • CAD improves both the quality and factuality of generated summaries from diverse language models
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Weijia Shi, Xiaochuang Han, Mike Lewis, Yulia Tsvetkov, Luke Zettlemoyer, Scott Wen-tau Yih

License: CC BY 4.0

Abstract: Language models (LMs) often struggle to pay enough attention to the input context, and generate texts that are unfaithful or contain hallucinations. To mitigate this issue, we present context-aware decoding (CAD), which follows a contrastive output distribution that amplifies the difference between the output probabilities when a model is used with and without context. Our experiments show that CAD, without additional training, significantly improves the faithfulness of different LM families, including OPT, GPT, LLaMA and FLAN-T5 for summarization tasks (e.g., 14.3% gain for LLaMA in factuality metrics). Furthermore, CAD is particularly effective in overriding a model's prior knowledge when it contradicts the provided context, leading to substantial improvements in tasks where resolving the knowledge conflict is essential.

Submitted to arXiv on 24 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2305.14739v1

The article discusses the issue of language models struggling to pay enough attention to input context, resulting in unfaithful or hallucinatory texts. To address this problem, the authors propose a solution called context-aware decoding (CAD), which amplifies the difference between output probabilities when a model is used with and without context. The experiments show that CAD significantly improves the faithfulness of different language models for summarization tasks, with a 14.3% gain in factuality metrics for LLaMA. CAD is particularly effective in overriding a model's prior knowledge when it contradicts the provided context, leading to substantial improvements in tasks where resolving knowledge conflicts is essential. The study introduces a hyperparameter α to control the adjustment level, with α = 0.5 generally yielding good results across all settings and datasets. The results on CNN-DM and XSUM datasets demonstrate that CAD outperforms standard decoding algorithms by a large margin, improving both the quality and factuality of generated summaries from diverse language models.
Created on 01 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.