Fine-tuning Language Models for Factuality

AI-generated keywords: Fact-checking Language Models Factuality NLP Fine-tuning

AI-generated Key Points

  • Manual fact-checking is time-consuming and expensive
  • Authors propose a method to fine-tune language models for improved factuality without human labeling
  • Two innovations in NLP are leveraged: measuring factuality using external knowledge base or model's confidence scores, and direct preference optimization algorithm
  • Factuality preference rankings are generated using retrieval systems or novel retrieval-free approach
  • Learning from these rankings enhances factuality of Llama-2 on various topics compared to other strategies
  • Significant reduction in factual error rates for biographies and medical questions compared to Llama-2-chat (58% reduction for biographies, 40% reduction for medical questions)
  • Future research directions include combining factuality tuning with existing methods, scaling up approach to larger models, more benchmarks on long-form language model generations' factuality, integrating reinforcement learning with factuality rankings
  • Work presents practical and effective strategy for improving language models' ability to generate factual content in long-form settings.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Katherine Tian, Eric Mitchell, Huaxiu Yao, Christopher D. Manning, Chelsea Finn

License: CC BY 4.0

Abstract: The fluency and creativity of large pre-trained language models (LLMs) have led to their widespread use, sometimes even as a replacement for traditional search engines. Yet language models are prone to making convincing but factually inaccurate claims, often referred to as 'hallucinations.' These errors can inadvertently spread misinformation or harmfully perpetuate misconceptions. Further, manual fact-checking of model responses is a time-consuming process, making human factuality labels expensive to acquire. In this work, we fine-tune language models to be more factual, without human labeling and targeting more open-ended generation settings than past work. We leverage two key recent innovations in NLP to do so. First, several recent works have proposed methods for judging the factuality of open-ended text by measuring consistency with an external knowledge base or simply a large model's confidence scores. Second, the direct preference optimization algorithm enables straightforward fine-tuning of language models on objectives other than supervised imitation, using a preference ranking over possible model responses. We show that learning from automatically generated factuality preference rankings, generated either through existing retrieval systems or our novel retrieval-free approach, significantly improves the factuality (percent of generated claims that are correct) of Llama-2 on held-out topics compared with RLHF or decoding strategies targeted at factuality. At 7B scale, compared to Llama-2-chat, we observe 58% and 40% reduction in factual error rate when generating biographies and answering medical questions, respectively.

Submitted to arXiv on 14 Nov. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2311.08401v1

Manual fact-checking is time-consuming and expensive, making it challenging to acquire human factuality labels. To tackle this problem, the authors propose a method to fine-tune language models for improved factuality without relying on human labeling. The authors leverage two recent innovations in natural language processing (NLP) to achieve their goal. First, they utilize methods that measure the factuality of open-ended text by assessing consistency with an external knowledge base or a large model's confidence scores. Second, they employ the direct preference optimization algorithm, which allows for straightforward fine-tuning of language models based on preference rankings rather than supervised imitation. To evaluate their approach, the authors generate factuality preference rankings using existing retrieval systems or their novel retrieval-free approach. They demonstrate that learning from these automatically generated rankings significantly enhances the factuality of Llama-2 on various topics compared to other strategies targeted at improving factuality. The results show a substantial reduction in factual error rates when generating biographies and answering medical questions using Llama-2 at a 7B scale. Specifically, there is a 58% reduction in factual error rate for biographies and a 40% reduction for medical questions compared to Llama-2-chat. The authors conclude by highlighting future research directions such as exploring additional ways to combine factuality tuning with existing methods and scaling up their approach to larger models. They also emphasize the need for more benchmarks on long-form language model generations' factuality and suggest investigating how best to integrate typical rewards and approaches from reinforcement learning with factuality rankings. Overall, this work presents a practical and effective strategy for improving language models' ability to generate factual content in long-form settings.
Created on 22 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.