Self-Refine: Iterative Refinement with Self-Feedback

AI-generated keywords: SELF-REFINE LLMs Feedback Refinement Tasks

AI-generated Key Points

  • Framework called SELF-REFINE introduced to improve initial outputs of large language models (LLMs)
  • SELF-REFINE generates output using LLM and allows the model to provide multi-aspect feedback for its own output
  • Model refines previously generated output based on its own feedback
  • Does not require supervised training data or reinforcement learning, works with a single LLM
  • Experimented with seven diverse tasks, including review rewriting and math reasoning
  • Compared outputs generated with SELF-REFINE to those generated directly with GPT-3.5 and GPT-4
  • Outputs generated with SELF-REFINE preferred by humans and automated metrics across all tasks, improving on average by 20%
  • Related work section discusses use of human and machine-generated natural language feedback in various tasks
  • Different sources of feedback explored, including humans, reinforcement learning approaches, automated sources like compilers or online sources such as Wikipedia edits, and LLMs themselves
  • Feedback can be in natural language or non-natural language forms
  • Presents a novel framework for improving LLM outputs through iterative refinement using self-feedback
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Sean Welleck, Bodhisattwa Prasad Majumder, Shashank Gupta, Amir Yazdanbakhsh, Peter Clark

Code, data, and demo at https://selfrefine.info/
License: CC BY 4.0

Abstract: Like people, LLMs do not always generate the best text for a given generation problem on their first try (e.g., summaries, answers, explanations). Just as people then refine their text, we introduce SELF-REFINE, a framework for similarly improving initial outputs from LLMs through iterative feedback and refinement. The main idea is to generate an output using an LLM, then allow the same model to provide multi-aspect feedback for its own output; finally, the same model refines its previously generated output given its own feedback. Unlike earlier work, our iterative refinement framework does not require supervised training data or reinforcement learning, and works with a single LLM. We experiment with 7 diverse tasks, ranging from review rewriting to math reasoning, demonstrating that our approach outperforms direct generation. In all tasks, outputs generated with SELF-REFINE are preferred by humans and by automated metrics over those generated directly with GPT-3.5 and GPT-4, improving on average by absolute 20% across tasks.

Submitted to arXiv on 30 Mar. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2303.17651v1

In this paper, the authors introduce a framework called SELF-REFINE which aims to improve the initial outputs generated by large language models (LLMs) through iterative feedback and refinement. The main idea is to generate an output using an LLM and then allow the same model to provide multi-aspect feedback for its own output. The model then refines its previously generated output based on its own feedback. Unlike previous approaches, SELF-REFINE does not require supervised training data or reinforcement learning and works with a single LLM. The authors experiment with seven diverse tasks, including review rewriting and math reasoning, to demonstrate the effectiveness of their approach. They compare the outputs generated with SELF-REFINE to those generated directly with GPT-3.5 and GPT-4. The results show that outputs generated with SELF-REFINE are preferred by both humans and automated metrics across all tasks, improving on average by 20% compared to direct generation. The related work section discusses the use of human- and machine-generated natural language feedback in various tasks such as summarization, script generation, program synthesis, computer vision, and others. Different sources of feedback are explored including humans, reinforcement learning based approaches, automated sources like compilers or online sources such as Wikipedia edits and LLMs themselves. The representation of feedback can be in natural language or non-natural language forms. Overall this paper presents a novel framework for improving LLM outputs through iterative refinement using self-feedback. The experimental results demonstrate the effectiveness of this approach across diverse tasks.
Created on 30 Sep. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.