Self-Discover: Large Language Models Self-Compose Reasoning Structures

AI-generated keywords: SELF-DISCOVER

AI-generated Key Points

  • The study introduces the framework of SELF-DISCOVER for LLMs to autonomously uncover task-specific reasoning structures
  • LLMs select atomic reasoning modules and combine them into an explicit structure to guide their decoding process
  • Significant improvements in performance on challenging benchmarks such as BigBench-Hard, grounded agent reasoning, and MATH compared to existing methods like Chain of Thought (CoT)
  • Outperforms inference-intensive approaches while requiring less compute
  • Self-discovered structures have demonstrated universality across different model families and exhibit similarities with human reasoning patterns
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Pei Zhou, Jay Pujara, Xiang Ren, Xinyun Chen, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou, Swaroop Mishra, Huaixiu Steven Zheng

17 pages, 11 figures, 5 tables
License: CC BY 4.0

Abstract: We introduce SELF-DISCOVER, a general framework for LLMs to self-discover the task-intrinsic reasoning structures to tackle complex reasoning problems that are challenging for typical prompting methods. Core to the framework is a self-discovery process where LLMs select multiple atomic reasoning modules such as critical thinking and step-by-step thinking, and compose them into an explicit reasoning structure for LLMs to follow during decoding. SELF-DISCOVER substantially improves GPT-4 and PaLM 2's performance on challenging reasoning benchmarks such as BigBench-Hard, grounded agent reasoning, and MATH, by as much as 32% compared to Chain of Thought (CoT). Furthermore, SELF-DISCOVER outperforms inference-intensive methods such as CoT-Self-Consistency by more than 20%, while requiring 10-40x fewer inference compute. Finally, we show that the self-discovered reasoning structures are universally applicable across model families: from PaLM 2-L to GPT-4, and from GPT-4 to Llama2, and share commonalities with human reasoning patterns.

Submitted to arXiv on 06 Feb. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2402.03620v1

The study titled "Self-Discover: Large Language Models Self-Compose Reasoning Structures" introduces the framework of SELF-DISCOVER for LLMs to autonomously uncover task-specific reasoning structures. This aims to address complex reasoning problems that are challenging for conventional prompting methods. The core concept involves a self-discovery process where LLMs select atomic reasoning modules and combine them into an explicit structure to guide their decoding process. In the first stage, relevant modules are selected, adapted, and implemented into a structured plan for solving the task. This approach has shown significant improvements in performance on challenging benchmarks such as BigBench-Hard, grounded agent reasoning, and MATH compared to existing methods like Chain of Thought (CoT). It also outperforms inference-intensive approaches while requiring less compute. The self-discovered structures have demonstrated universality across different model families and exhibit similarities with human reasoning patterns. Overall, SELF-DISCOVER presents a promising approach for enhancing LLMs' ability to tackle intricate reasoning tasks through autonomous discovery of task-specific structures.
Created on 18 Feb. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.