Successive Prompting for Decomposing Complex Questions

AI-generated keywords: Successive Prompting Question Decomposition Question Answering Synthetic Dataset DROP Dataset

AI-generated Key Points

  • Complex question answering with limited supervision is challenging
  • Large language models (LMs) can perform complex question answering in a few-shot setting
  • Previous approach requires manually written intermediate supervision, which is expensive to collect
  • Successive Prompting breaks down complex tasks into simple tasks and solves them iteratively
  • Decouples supervision for decomposing complex questions from the supervision for answering simple questions
  • Three types of model outputs: intermediate questions qk, intermediate answers ak, and the final answer y
  • Model alternates between QD and QA until it terminates
  • Successive prompting can be used with model fine-tuning using synthetic data
  • Best model achieved an improvement of approximately 5% absolute F1 on a few-shot version of the DROP dataset compared with state-of-the-art models with similar supervision.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Dheeru Dua, Shivanshu Gupta, Sameer Singh, Matt Gardner

License: CC BY 4.0

Abstract: Answering complex questions that require making latent decisions is a challenging task, especially when limited supervision is available. Recent works leverage the capabilities of large language models (LMs) to perform complex question answering in a few-shot setting by demonstrating how to output intermediate rationalizations while solving the complex question in a single pass. We introduce ``Successive Prompting'', where we iteratively break down a complex task into a simple task, solve it, and then repeat the process until we get the final solution. Successive prompting decouples the supervision for decomposing complex questions from the supervision for answering simple questions, allowing us to (1) have multiple opportunities to query in-context examples at each reasoning step (2) learn question decomposition separately from question answering, including using synthetic data, and (3) use bespoke (fine-tuned) components for reasoning steps where a large LM does not perform well. The intermediate supervision is typically manually written, which can be expensive to collect. We introduce a way to generate a synthetic dataset which can be used to bootstrap a model's ability to decompose and answer intermediate questions. Our best model (with successive prompting) achieves an improvement of ~5% absolute F1 on a few-shot version of the DROP dataset when compared with a state-of-the-art model with the same supervision.

Submitted to arXiv on 08 Dec. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2212.04092v1

The ability to answer complex questions that require making latent decisions is a challenging task, particularly when limited supervision is available. Recent works have utilized large language models (LMs) to perform complex question answering in a few-shot setting by outputting intermediate rationalizations while solving the complex question in a single pass. However, this approach requires manually written intermediate supervision, which can be expensive to collect. To address this issue, researchers have introduced "Successive Prompting," where they iteratively break down a complex task into simple tasks and solve them until they arrive at the final solution. This approach decouples the supervision for decomposing complex questions from the supervision for answering simple questions, allowing for multiple opportunities to query in-context examples at each reasoning step and learn question decomposition separately from question answering using synthetic data. There are three types of model outputs: intermediate questions qk, intermediate answers ak, and the final answer y. The first type of output is referred to as question decomposition (QD), while the second type is known as question answering (QA). Final answer prediction is treated as a special case of QD where the model decides that no more decomposition is necessary and outputs a final answer. The model iteratively alternates between QD and QA until it terminates. Successive prompting can also be used with model fine-tuning, where each intermediate output serves as training examples for LMs. The researchers generated a synthetic dataset that can bootstrap a model's ability to decompose and answer intermediate questions. Their best model achieved an improvement of approximately 5% absolute F1 on a few-shot version of the DROP dataset compared with state-of-the-art models with similar supervision. Overall, Successive Prompting provides an effective way to decompose complex questions into simpler ones while learning how to answer them separately using large language models or bespoke components when necessary.
Created on 28 May. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.