Test-time Computing: from System-1 Thinking to System-2 Thinking

AI-generated keywords: Test-time Computing System-1 Thinking System-2 Thinking Complex Reasoning Models Artificial Intelligence

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper by Yixin Ji et al. explores test-time computing scaling in complex reasoning models
  • The o1 model demonstrates impressive performance in handling intricate reasoning tasks
  • Test-time computing can enhance the capabilities of the o1 model, enabling more powerful System-2 thinking
  • There is a gap in comprehensive surveys focusing on test-time computing scaling
  • Test-time computing addresses distribution shifts, enhances robustness, and generalization through techniques like parameter updating, input modification, representation editing, and output calibration
  • Strategies like repeated sampling and tree search algorithms are used to improve reasoning abilities for tackling complex problems
  • The study organizes its survey based on the evolution from weaker System-2 models to stronger ones with the help of test-time computing
  • Several potential future research directions are highlighted by the authors
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yixin Ji, Juntao Li, Hai Ye, Kaixin Wu, Jia Xu, Linjian Mo, Min Zhang

work in progress

Abstract: The remarkable performance of the o1 model in complex reasoning demonstrates that test-time computing scaling can further unlock the model's potential, enabling powerful System-2 thinking. However, there is still a lack of comprehensive surveys for test-time computing scaling. We trace the concept of test-time computing back to System-1 models. In System-1 models, test-time computing addresses distribution shifts and improves robustness and generalization through parameter updating, input modification, representation editing, and output calibration. In System-2 models, it enhances the model's reasoning ability to solve complex problems through repeated sampling, self-correction, and tree search. We organize this survey according to the trend of System-1 to System-2 thinking, highlighting the key role of test-time computing in the transition from System-1 models to weak System-2 models, and then to strong System-2 models. We also point out a few possible future directions.

Submitted to arXiv on 05 Jan. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2501.02497v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper "Test-time Computing: from System-1 Thinking to System-2 Thinking" by Yixin Ji, Juntao Li, Hai Ye, Kaixin Wu, Jia Xu, Linjian Mo, and Min Zhang explores the concept of test-time computing scaling in the context of complex reasoning models. The authors highlight the impressive performance of the o1 model in handling intricate reasoning tasks and emphasize how test-time computing can further enhance the model's capabilities, enabling more powerful System-2 thinking. Despite the promising results demonstrated by the o1 model, there remains a gap in comprehensive surveys focusing on test-time computing scaling. is a crucial aspect in improving model performance and advancing towards more sophisticated forms of reasoning within systems. In their study, Ji et al. delve into its origins and trace it back to , where it addresses distribution shifts and enhances robustness and generalization through various techniques such as parameter updating, input modification, representation editing, and output calibration. On the other hand, heavily relies on test-time computing to improve reasoning abilities for tackling complex problems. This is achieved through strategies like repeated sampling,, and tree search algorithms. The paper organizes its survey based on the evolution from to , emphasizing how test-time computing facilitates the transition from weaker <kd>System-2 models</ kd > to stronger ones. Additionally,< kd >the authors point out several potential future directions for research in this area.</ kd > Overall,this study sheds light on the significance of test-time computing scaling in enhancing model performance and advancing towards more sophisticated forms of reasoning within artificial intelligence systems.
Created on 11 Jan. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.