Successive Prompting for Decomposing Complex Questions

AI-generated keywords: Successive Prompting Question Decomposition Question Answering Synthetic Dataset DROP Dataset

AI-generated Key Points

Complex question answering with limited supervision is challenging
Large language models (LMs) can perform complex question answering in a few-shot setting
Previous approach requires manually written intermediate supervision, which is expensive to collect
Successive Prompting breaks down complex tasks into simple tasks and solves them iteratively
Decouples supervision for decomposing complex questions from the supervision for answering simple questions
Three types of model outputs: intermediate questions qk, intermediate answers ak, and the final answer y
Model alternates between QD and QA until it terminates
Successive prompting can be used with model fine-tuning using synthetic data
Best model achieved an improvement of approximately 5% absolute F1 on a few-shot version of the DROP dataset compared with state-of-the-art models with similar supervision.

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Dheeru Dua, Shivanshu Gupta, Sameer Singh, Matt Gardner

arXiv: 2212.04092v1 - DOI (cs.CL)

License: CC BY 4.0

Abstract: Answering complex questions that require making latent decisions is a challenging task, especially when limited supervision is available. Recent works leverage the capabilities of large language models (LMs) to perform complex question answering in a few-shot setting by demonstrating how to output intermediate rationalizations while solving the complex question in a single pass. We introduce ``Successive Prompting'', where we iteratively break down a complex task into a simple task, solve it, and then repeat the process until we get the final solution. Successive prompting decouples the supervision for decomposing complex questions from the supervision for answering simple questions, allowing us to (1) have multiple opportunities to query in-context examples at each reasoning step (2) learn question decomposition separately from question answering, including using synthetic data, and (3) use bespoke (fine-tuned) components for reasoning steps where a large LM does not perform well. The intermediate supervision is typically manually written, which can be expensive to collect. We introduce a way to generate a synthetic dataset which can be used to bootstrap a model's ability to decompose and answer intermediate questions. Our best model (with successive prompting) achieves an improvement of ~5% absolute F1 on a few-shot version of the DROP dataset when compared with a state-of-the-art model with the same supervision.

Submitted to arXiv on 08 Dec. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2212.04092v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

The ability to answer complex questions that require making latent decisions is a challenging task, particularly when limited supervision is available. Recent works have utilized large language models (LMs) to perform complex question answering in a few-shot setting by outputting intermediate rationalizations while solving the complex question in a single pass. However, this approach requires manually written intermediate supervision, which can be expensive to collect. To address this issue, researchers have introduced "Successive Prompting," where they iteratively break down a complex task into simple tasks and solve them until they arrive at the final solution. This approach decouples the supervision for decomposing complex questions from the supervision for answering simple questions, allowing for multiple opportunities to query in-context examples at each reasoning step and learn question decomposition separately from question answering using synthetic data. There are three types of model outputs: intermediate questions qk, intermediate answers ak, and the final answer y. The first type of output is referred to as question decomposition (QD), while the second type is known as question answering (QA). Final answer prediction is treated as a special case of QD where the model decides that no more decomposition is necessary and outputs a final answer. The model iteratively alternates between QD and QA until it terminates. Successive prompting can also be used with model fine-tuning, where each intermediate output serves as training examples for LMs. The researchers generated a synthetic dataset that can bootstrap a model's ability to decompose and answer intermediate questions. Their best model achieved an improvement of approximately 5% absolute F1 on a few-shot version of the DROP dataset compared with state-of-the-art models with similar supervision. Overall, Successive Prompting provides an effective way to decompose complex questions into simpler ones while learning how to answer them separately using large language models or bespoke components when necessary.

- Complex question answering with limited supervision is challenging
- Large language models (LMs) can perform complex question answering in a few-shot setting
- Previous approach requires manually written intermediate supervision, which is expensive to collect
- Successive Prompting breaks down complex tasks into simple tasks and solves them iteratively
- Decouples supervision for decomposing complex questions from the supervision for answering simple questions
- Three types of model outputs: intermediate questions qk, intermediate answers ak, and the final answer y
- Model alternates between QD and QA until it terminates
- Successive prompting can be used with model fine-tuning using synthetic data
- Best model achieved an improvement of approximately 5% absolute F1 on a few-shot version of the DROP dataset compared with state-of-the-art models with similar supervision.

"Scientists are trying to teach computers how to answer complex questions with limited help. They use big computer programs called Large Language Models (LMs) to do this. Before, people had to give the computer lots of help, which was hard and expensive. Now, they use a method called Successive Prompting that breaks down hard questions into easier ones for the computer to solve step by step. The computer gives three types of answers: intermediate questions, intermediate answers, and the final answer. This method helps the computer learn better and faster." Definitions- Complex question answering: when a machine tries to understand and answer difficult questions - Limited supervision: when a machine is given only a little bit of help or guidance - Large language models (LMs): big computer programs that can understand and interpret human language - Intermediate supervision: giving the machine guidance or feedback during its learning process - Successive Prompting: breaking down complex tasks into simpler ones for easier understanding

Breaking Down Complex Questions with Successive Prompting

Solving complex questions that require making decisions is a difficult task, especially when limited supervision is available. Recent works have used large language models (LMs) to answer complex questions in a few-shot setting by outputting intermediate rationalizations while solving the question in one pass. However, this approach requires manually written intermediate supervision which can be costly to obtain. To address this issue, researchers have introduced "Successive Prompting," where they break down a complex task into simpler tasks and solve them until they reach the final solution.

What is Successive Prompting?

Successive prompting decouples the supervision for decomposing complex questions from the supervision for answering simple questions, allowing multiple opportunities to query in-context examples at each reasoning step and learn question decomposition separately from question answering using synthetic data. The model outputs three types of information: intermediate questions qk, intermediate answers ak, and the final answer y. Question decomposition (QD) refers to the first type of output while question answering (QA) refers to the second type of output. Final answer prediction is treated as a special case of QD where no more decomposition is necessary and an answer is provided instead. The model iterates between QD and QA until it terminates.

Model Fine-Tuning with Synthetic Data

Successive prompting can also be used with model fine-tuning where each intermediate output serves as training examples for LMs. Researchers generated a synthetic dataset that can bootstrap a model's ability to decompose and answer intermediate questions accurately without manual intervention or expensive annotation costs associated with supervised learning methods like human labeling or crowdsourcing services like Amazon Mechanical Turk (AMT). Their best model achieved an improvement of approximately 5% absolute F1 on a few-shot version of the DROP dataset compared with state-of-the-art models using similar supervision techniques such as AMT or human labeling approaches which are both expensive methods for obtaining data annotations.

Conclusion

In conclusion, successive prompting provides an effective way to break down complex questions into simpler ones while learning how to answer them separately using large language models or bespoke components when necessary without having to rely on expensive annotation costs associated with supervised learning methods like human labeling or crowdsourcing services like Amazon Mechanical Turk (AMT). This technique has been shown to improve accuracy on datasets such as DROP by up to 5% absolute F1 score compared with existing state-of-the art models utilizing similar techniques such as AMT or human labeling approaches

Created on 28 May. 2023

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

63.0%

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

cs.CL

60.6%

Answering Questions by Meta-Reasoning over Multiple Chains of Thought

cs.CL

60.3%

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in N…

cs.CL

60.2%

Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by L…

cs.CL

58.9%

An automatically discovered chain-of-thought prompt generalizes to novel mode…

cs.CL

58.4%

ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language …

cs.CL

58.2%

Question Generation for Adaptive Education

cs.CL

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.