RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

AI-generated keywords: Retrieval-Augmented Thoughts (RAT)

AI-generated Key Points

  • Introduction of Retrieval-Augmented Thoughts (RAT) method to enhance reasoning and generation abilities of large language models in long-horizon tasks
  • Leveraging iterative refinement of retrieval queries based on evolving reasoning thoughts for more accurate and efficient context generation
  • Case analysis focusing on embodied planning in Minecraft and open-ended creative writing tasks
  • RAT addressing inaccuracies in procedural steps by continuously refining thoughts with targeted retrieval, improving planning effectiveness
  • Outperformance of RAT over other retrieval strategies in creative writing tasks like summarizing historical events by aligning closely with task progression and retrieving accurate information
  • Implementation of rigorous pre-processing methodology to ensure validity of results and mitigate benchmark contamination
  • Comprehensive evaluation demonstrating consistent outperformance of RAT over other methods in various tasks across different domains
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Zihao Wang, Anji Liu, Haowei Lin, Jiaqi Li, Xiaojian Ma, Yitao Liang

License: CC BY 4.0

Abstract: We explore how iterative revising a chain of thoughts with the help of information retrieval significantly improves large language models' reasoning and generation ability in long-horizon generation tasks, while hugely mitigating hallucination. In particular, the proposed method -- *retrieval-augmented thoughts* (RAT) -- revises each thought step one by one with retrieved information relevant to the task query, the current and the past thought steps, after the initial zero-shot CoT is generated. Applying RAT to GPT-3.5, GPT-4, and CodeLLaMA-7b substantially improves their performances on various long-horizon generation tasks; on average of relatively increasing rating scores by 13.63% on code generation, 16.96% on mathematical reasoning, 19.2% on creative writing, and 42.78% on embodied task planning. The demo page can be found at https://craftjarvis.github.io/RAT

Submitted to arXiv on 08 Mar. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2403.05313v1

, , , , A novel method called Retrieval-Augmented Thoughts (RAT) has been introduced by researchers to enhance the reasoning and generation abilities of large language models in long-horizon tasks. This approach leverages iterative refinement of retrieval queries based on evolving reasoning thoughts, leading to more accurate and efficient context generation. The case analysis focused on two specific tasks: embodied planning in Minecraft and open-ended creative writing. In the Minecraft task, traditional methods like ChatGPT showed inaccuracies in procedural steps due to fragmented knowledge sources. However, RAT addressed this issue by continuously refining thoughts with targeted retrieval, improving planning effectiveness by ensuring a comprehensive understanding of all items involved in a plan. For creative writing tasks like summarizing historical events, RAT outperformed other retrieval strategies by aligning closely with task progression and retrieving accurate information. To ensure the validity of their results, the researchers implemented a rigorous pre-processing methodology to mitigate any potential benchmark contamination. The comprehensive evaluation across multiple benchmarks consistently demonstrated that RAT outperformed other methods in various tasks. These findings highlight the effectiveness of RAT in eliciting context-aware reasoning and improving performance in long-horizon generation tasks across different domains.
Created on 30 May. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.