A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

AI-generated keywords: Large language models Hallucinations Natural language processing Detection Mitigation

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Large language models (LLMs) have revolutionized natural language processing (NLP), enabling significant advancements in text understanding and generation.
  • A critical issue plaguing LLMs is their tendency to produce hallucinations, generating content that deviates from real-world facts or user inputs.
  • There has been a growing focus on detecting and mitigating these hallucinations to address the challenges they pose in practical deployment of LLMs.
  • The survey titled "A Survey on Hallucination in Large Language Models" provides an overview of recent advances in addressing LLM hallucinations, including a taxonomy of hallucinations, factors contributing to their occurrence, methods for detection, benchmarks, and approaches for mitigation.
  • The survey aims to pave the way for future research by analyzing current limitations and formulating open questions related to addressing hallucinations in LLMs.
  • This comprehensive survey spans 49 pages and serves as a valuable resource for researchers and practitioners seeking insights into enhancing the reliability and accuracy of LLM-generated content in NLP applications.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Lei Huang, Weijiang Yu, Weitao Ma, Weihong Zhong, Zhangyin Feng, Haotian Wang, Qianglong Chen, Weihua Peng, Xiaocheng Feng, Bing Qin, Ting Liu

Work in progress; 49 pages

Abstract: The emergence of large language models (LLMs) has marked a significant breakthrough in natural language processing (NLP), leading to remarkable advancements in text understanding and generation. Nevertheless, alongside these strides, LLMs exhibit a critical tendency to produce hallucinations, resulting in content that is inconsistent with real-world facts or user inputs. This phenomenon poses substantial challenges to their practical deployment and raises concerns over the reliability of LLMs in real-world scenarios, which attracts increasing attention to detect and mitigate these hallucinations. In this survey, we aim to provide a thorough and in-depth overview of recent advances in the field of LLM hallucinations. We begin with an innovative taxonomy of LLM hallucinations, then delve into the factors contributing to hallucinations. Subsequently, we present a comprehensive overview of hallucination detection methods and benchmarks. Additionally, representative approaches designed to mitigate hallucinations are introduced accordingly. Finally, we analyze the challenges that highlight the current limitations and formulate open questions, aiming to delineate pathways for future research on hallucinations in LLMs.

Submitted to arXiv on 09 Nov. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2311.05232v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The emergence of large language models (LLMs) has revolutionized natural language processing (NLP), enabling significant advancements in text understanding and generation. However, a critical issue plaguing LLMs is their tendency to produce hallucinations, generating content that deviates from real-world facts or user inputs. This phenomenon poses substantial challenges to the practical deployment of LLMs and raises concerns about their reliability in real-world scenarios. In response to this challenge, there has been a growing focus on detecting and mitigating these hallucinations. In their comprehensive survey titled "A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions," authors Lei Huang, Weijiang Yu, Weitao Ma, Weihong Zhong, Zhangyin Feng, Haotian Wang, Qianglong Chen, Weihua Peng, Xiaocheng Feng, Bing Qin, and Ting Liu aim to provide an in-depth overview of recent advances in addressing LLM hallucinations. The survey begins by introducing an innovative taxonomy of LLM hallucinations and delves into the factors contributing to their occurrence. Subsequently, the authors present a detailed overview of various methods and benchmarks for detecting hallucinations in LLM-generated content. Moreover, the survey highlights representative approaches designed to mitigate hallucinations effectively. By analyzing the current limitations and formulating open questions in this domain, the authors aim to pave the way for future research on addressing hallucinations in LLMs. This work is still in progress and spans 49 pages. Overall, this survey serves as a valuable resource for researchers and practitioners seeking insights into tackling the challenges posed by hallucinations in large language models. Through its thorough examination of principles, taxonomy, challenges, and open questions surrounding LLM hallucinations, this survey contributes significantly to advancing our understanding of how to enhance the reliability and accuracy of LLM-generated content in NLP applications.
Created on 13 Mar. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.