Self-Refine: Iterative Refinement with Self-Feedback
AI-generated Key Points
- Framework called SELF-REFINE introduced to improve initial outputs of large language models (LLMs)
- SELF-REFINE generates output using LLM and allows the model to provide multi-aspect feedback for its own output
- Model refines previously generated output based on its own feedback
- Does not require supervised training data or reinforcement learning, works with a single LLM
- Experimented with seven diverse tasks, including review rewriting and math reasoning
- Compared outputs generated with SELF-REFINE to those generated directly with GPT-3.5 and GPT-4
- Outputs generated with SELF-REFINE preferred by humans and automated metrics across all tasks, improving on average by 20%
- Related work section discusses use of human and machine-generated natural language feedback in various tasks
- Different sources of feedback explored, including humans, reinforcement learning approaches, automated sources like compilers or online sources such as Wikipedia edits, and LLMs themselves
- Feedback can be in natural language or non-natural language forms
- Presents a novel framework for improving LLM outputs through iterative refinement using self-feedback
Authors: Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Sean Welleck, Bodhisattwa Prasad Majumder, Shashank Gupta, Amir Yazdanbakhsh, Peter Clark
Abstract: Like people, LLMs do not always generate the best text for a given generation problem on their first try (e.g., summaries, answers, explanations). Just as people then refine their text, we introduce SELF-REFINE, a framework for similarly improving initial outputs from LLMs through iterative feedback and refinement. The main idea is to generate an output using an LLM, then allow the same model to provide multi-aspect feedback for its own output; finally, the same model refines its previously generated output given its own feedback. Unlike earlier work, our iterative refinement framework does not require supervised training data or reinforcement learning, and works with a single LLM. We experiment with 7 diverse tasks, ranging from review rewriting to math reasoning, demonstrating that our approach outperforms direct generation. In all tasks, outputs generated with SELF-REFINE are preferred by humans and by automated metrics over those generated directly with GPT-3.5 and GPT-4, improving on average by absolute 20% across tasks.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.