From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step
AI-generated Key Points
⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.
- Authors Yuntian Deng, Yejin Choi, and Stuart Shieber explore the use of language models for reasoning tasks
- Emphasis on generating explicit chain-of-thought (CoT) steps for high accuracy in final outputs
- Proposal of a novel method to train models to internalize CoT steps by gradually removing intermediate steps through fine-tuning
- Approach allows for simplified reasoning processes while maintaining high performance levels
- GPT-2 Small model achieves impressive accuracy rates of up to 99% on 9-by-9 multiplication problems using this method
- Effectiveness demonstrated on larger language models like Mistral 7B achieving over 50% accuracy on GSM8K without producing any intermediate steps
- Study highlights potential for improving AI systems' reasoning processes through internalization of chain-of-thought steps
Authors: Yuntian Deng, Yejin Choi, Stuart Shieber
Abstract: When leveraging language models for reasoning tasks, generating explicit chain-of-thought (CoT) steps often proves essential for achieving high accuracy in final outputs. In this paper, we investigate if models can be taught to internalize these CoT steps. To this end, we propose a simple yet effective method for internalizing CoT steps: starting with a model trained for explicit CoT reasoning, we gradually remove the intermediate steps and finetune the model. This process allows the model to internalize the intermediate reasoning steps, thus simplifying the reasoning process while maintaining high performance. Our approach enables a GPT-2 Small model to solve 9-by-9 multiplication with up to 99% accuracy, whereas standard training cannot solve beyond 4-by-4 multiplication. Furthermore, our method proves effective on larger language models, such as Mistral 7B, achieving over 50% accuracy on GSM8K without producing any intermediate steps.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.