Chain-of-Thought Reasoning Without Prompting

AI-generated keywords: Chain-of-Thought Reasoning

AI-generated Key Points

  • Study explores effectiveness of large language models (LLMs) in reasoning without specific prompting techniques
  • Investigates whether LLMs can effectively reason without prompts by altering decoding process
  • CoT reasoning paths can be elicited from pre-trained LLMs by exploring alternative tokens in top-k sequences
  • Presence of CoT in decoding path correlates with higher confidence in model's decoded answer
  • Proposed CoT-decoding method significantly outperforms standard greedy decoding on various reasoning benchmarks
  • Study removes need for CoT prompting and focuses on token-level search during decoding while utilizing confidence scores
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Xuezhi Wang, Denny Zhou

License: CC BY 4.0

Abstract: In enhancing the reasoning capabilities of large language models (LLMs), prior research primarily focuses on specific prompting techniques such as few-shot or zero-shot chain-of-thought (CoT) prompting. These methods, while effective, often involve manually intensive prompt engineering. Our study takes a novel approach by asking: Can LLMs reason effectively without prompting? Our findings reveal that, intriguingly, CoT reasoning paths can be elicited from pre-trained LLMs by simply altering the \textit{decoding} process. Rather than conventional greedy decoding, we investigate the top-$k$ alternative tokens, uncovering that CoT paths are frequently inherent in these sequences. This approach not only bypasses the confounders of prompting but also allows us to assess the LLMs' \textit{intrinsic} reasoning abilities. Moreover, we observe that the presence of a CoT in the decoding path correlates with a higher confidence in the model's decoded answer. This confidence metric effectively differentiates between CoT and non-CoT paths. Extensive empirical studies on various reasoning benchmarks show that the proposed CoT-decoding substantially outperforms the standard greedy decoding.

Submitted to arXiv on 15 Feb. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2402.10200v1

, , , , The study "Chain-of-Thought Reasoning Without Prompting" delves into the effectiveness of large language models (LLMs) in reasoning without the need for specific prompting techniques. Previous research has focused on methods such as few-shot or zero-shot chain-of-thought (CoT) prompting, which require manual prompt engineering. However, this study takes a new approach by investigating whether LLMs can effectively reason without prompts. The findings reveal that CoT reasoning paths can be elicited from pre-trained LLMs by simply altering the decoding process. By exploring alternative tokens in the top-k sequences instead of using conventional greedy decoding, the study uncovers that CoT paths are frequently present in these sequences. This not only eliminates the need for prompting but also allows for an assessment of the LLMs' intrinsic reasoning abilities. Furthermore, it is observed that the presence of a CoT in the decoding path correlates with higher confidence in the model's decoded answer. This confidence metric effectively distinguishes between CoT and non-CoT paths. Extensive empirical studies on various reasoning benchmarks demonstrate that the proposed CoT-decoding method significantly outperforms standard greedy decoding. In contrast to recent works that still rely on CoT prompting to improve generation processes, this study completely removes that need and focuses on searching at the token-level during decoding while utilizing confidence scores. Additionally, other recent works explore how chain-of-thought emerges in language models and highlight how pretraining distribution influences model performance in few-shot reasoning scenarios. Techniques such as instruction-tuning or distillation offer alternative ways to elicit reasoning paths from language models without explicit prompting. Overall, this study provides valuable insights into enhancing LLMs' reasoning capabilities without relying on traditional prompting methods and showcases significant improvements through innovative decoding strategies.
Created on 25 Feb. 2024
Available in other languages: fr

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.