ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models

AI-generated keywords: Augmented Language Models (ALMs)

AI-generated Key Points

  • Augmented Language Models (ALMs) combine reasoning capabilities of Large Language Models (LLMs) with knowledge retrieval and action execution tools.
  • Existing ALM systems have huge computation complexity due to redundant prompts and repeated execution.
  • ReWOO (Reasoning WithOut Observation) is a modular paradigm proposed in this study that detaches the reasoning process from external observations, significantly reducing token consumption.
  • Comprehensive evaluations across six public NLP benchmarks and a curated dataset reveal consistent performance enhancements with the proposed methodology.
  • ReWOO achieves 5x token efficiency and 4% accuracy improvement on HotpotQA, a multi-step reasoning benchmark.
  • ReWOO demonstrates robustness under tool-failure scenarios.
  • Decoupling parametric modules from non-parametric tool calls enables instruction fine-tuning to offload LLMs into smaller language models, thus substantially reducing model parameters.
  • An illustrative work offloads reasoning ability from 175B GPT3.5 into 7B LLaMA, demonstrating significant potential.
  • The study provides detailed descriptions of provided tools appended into the context prompt to enable zero-shot evaluation for ReWOO Planner covering information retrieval, comparison, equation solving, and calculating for different benchmarks.
  • The number of reasoning steps k in exemplars is typically 2 or 3.
  • This study proposes a novel approach to tackle challenges associated with existing ALM systems by introducing ReWOO as a modular paradigm that detaches the reasoning process from external observations while achieving consistent performance enhancements across various benchmarks.
  • This approach offers significant potential for truly efficient and scalable ALM systems by enabling instruction fine-tuning to offload LLMs into smaller language models while reducing model parameters.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Binfeng Xu, Zhiyuan Peng, Bowen Lei, Subhabrata Mukherjee, Yuchen Liu, Dongkuan Xu

License: CC BY 4.0

Abstract: Augmented Language Models (ALMs) blend the reasoning capabilities of Large Language Models (LLMs) with tools that allow for knowledge retrieval and action execution. Existing ALM systems trigger LLM thought processes while pulling observations from these tools in an interleaved fashion. Specifically, an LLM reasons to call an external tool, gets halted to fetch the tool's response, and then decides the next action based on all preceding response tokens. Such a paradigm, though straightforward and easy to implement, often leads to huge computation complexity from redundant prompts and repeated execution. This study addresses such challenges for the first time, proposing a modular paradigm ReWOO (Reasoning WithOut Observation) that detaches the reasoning process from external observations, thus significantly reducing token consumption. Comprehensive evaluations across six public NLP benchmarks and a curated dataset reveal consistent performance enhancements with our proposed methodology. Notably, ReWOO achieves 5x token efficiency and 4% accuracy improvement on HotpotQA, a multi-step reasoning benchmark. Furthermore, ReWOO demonstrates robustness under tool-failure scenarios. Beyond prompt efficiency, decoupling parametric modules from non-parametric tool calls enables instruction fine-tuning to offload LLMs into smaller language models, thus substantially reducing model parameters. Our illustrative work offloads reasoning ability from 175B GPT3.5 into 7B LLaMA, demonstrating the significant potential for truly efficient and scalable ALM systems.

Submitted to arXiv on 23 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2305.18323v1

Augmented Language Models (ALMs) have been developed to combine the reasoning capabilities of Large Language Models (LLMs) with tools that enable knowledge retrieval and action execution. However, existing ALM systems suffer from huge computation complexity due to redundant prompts and repeated execution. To address this challenge, a modular paradigm called ReWOO (Reasoning WithOut Observation) has been proposed in this study, which detaches the reasoning process from external observations and significantly reduces token consumption. The comprehensive evaluations across six public NLP benchmarks and a curated dataset reveal consistent performance enhancements with the proposed methodology. Notably, ReWOO achieves 5x token efficiency and 4% accuracy improvement on HotpotQA, a multi-step reasoning benchmark. Furthermore, ReWOO demonstrates robustness under tool-failure scenarios. The study also highlights the potential for truly efficient and scalable ALM systems by decoupling parametric modules from non-parametric tool calls that enable instruction fine-tuning to offload LLMs into smaller language models, thus substantially reducing model parameters. An illustrative work offloads reasoning ability from 175B GPT3.5 into 7B LLaMA, demonstrating significant potential. The study provides a detailed description of the provided tools appended into the context prompt to enable zero-shot evaluation. Exemplars are manually crafted for ReWOO Planner covering information retrieval, comparison, equation solving, and calculating for different benchmarks. The number of reasoning steps k in exemplars is typically 2 or 3. In summary, this study proposes a novel approach to tackle the challenges associated with existing ALM systems by introducing ReWOO as a modular paradigm that detaches the reasoning process from external observations while achieving consistent performance enhancements across various benchmarks. This approach offers significant potential for truly efficient and scalable ALM systems by enabling instruction fine-tuning to offload LLMs into smaller language models while reducing model parameters.
Created on 07 Jun. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.