EXPEREPAIR: Dual-Memory Enhanced LLM-based Repository-Level Program Repair

AI-generated keywords: Automatic Program Repair Large Language Models Dual Memory Systems Software Engineering AI

AI-generated Key Points

  • Automatically repairing software issues is a persistent challenge at the intersection of software engineering and AI.
  • Recent advancements in Large Language Models (LLMs) have shown promise for repository-level repair tasks, but current methodologies have limitations.
  • Current approaches often address issues in isolation without leveraging insights from past problem resolutions and rely on static prompting strategies that hinder adaptability.
  • ExpeRepair is a novel LLM-based approach inspired by human cognition's dual memory systems, which continuously learns from historical repair experiences through dual-channel knowledge accumulation.
  • ExpeRepair organizes past repair experiences into episodic memory storing concrete repair demonstrations and semantic memory encoding abstract reflective insights.
  • During inference, ExpeRepair activates both memory systems to retrieve relevant demonstrations and high-level repair insights for context-aware prompt generation based on experience.
  • Experiments show that ExpeRepair achieves a pass@1 score of 49.3% with Claude 3.7 Sonnet, outperforming all state-of-the-art open-source methods on the SWE-bench Lite benchmark.
  • By incorporating historical repair experiences into repository-level Automatic Program Repair (APR), ExpeRepair aims to emulate how developers learn, adapt, and improve over time within a software project.
  • This innovative approach draws inspiration from cognitive science's dual-memory system model and offers a promising direction for enhancing software issue resolution in real-world development scenarios.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Fangwen Mu, Junjie Wang, Lin Shi, Song Wang, Shoubin Li, Qing Wang

License: CC BY 4.0

Abstract: Automatically repairing software issues remains a fundamental challenge at the intersection of software engineering and AI. Although recent advancements in Large Language Models (LLMs) have demonstrated potential for repository-level repair tasks, current methodologies exhibit two notable limitations: (1) they often address issues in isolation, neglecting to incorporate insights from previously resolved issues, and (2) they rely on static and rigid prompting strategies, which constrain their ability to generalize across diverse and evolving issue scenarios. Inspired by the dual memory systems of human cognition, where episodic and semantic memories work synergistically to support human reasoning and decision-making, we propose ExpeRepair, a novel LLM-based approach that continuously learns from historical repair experiences through dual-channel knowledge accumulation. ExpeRepair organizes historical repair experiences into two complementary memories: an episodic memory that stores concrete repair demonstrations, and a semantic memory that encodes abstract reflective insights. At inference time, ExpeRepair activates both memory systems by retrieving relevant demonstrations from episodic memory and recalling high-level repair insights from semantic memory. It further enhances adaptability through dynamic prompt composition, synergistically integrating both memory types to replace static prompts with context-aware, experience-driven prompts. Experiments on the SWE-bench Lite benchmark demonstrate that ExpeRepair achieves a pass@1 score of 49.3% with Claude 3.7 Sonnet, outperforming all state-of-the-art open-source methods.

Submitted to arXiv on 12 Jun. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2506.10484v1

Automatically repairing software issues is a persistent challenge at the intersection of software engineering and AI. Recent advancements in Large Language Models (LLMs) have shown promise for repository-level repair tasks, but current methodologies have limitations. These approaches often address issues in isolation without leveraging insights from past problem resolutions. Additionally, they rely on static prompting strategies that hinder their ability to adapt to diverse and evolving scenarios. To address these challenges, we introduce ExpeRepair - a novel LLM-based approach inspired by human cognition's dual memory systems. This innovative method continuously learns from historical repair experiences through dual-channel knowledge accumulation. It organizes past repair experiences into two complementary memories: an episodic memory storing concrete repair demonstrations and a semantic memory encoding abstract reflective insights. During inference, ExpeRepair activates both memory systems by retrieving relevant demonstrations from the episodic memory and recalling high-level repair insights from the semantic memory. This approach enhances adaptability through dynamic prompt composition, integrating both memory types to generate context-aware prompts based on experience. Experiments conducted on the SWE-bench Lite benchmark demonstrate that ExpeRepair achieves a pass@1 score of 49.3% with Claude 3.7 Sonnet - outperforming all state-of-the-art open-source methods. By explicitly incorporating historical repair experiences into repository-level Automatic Program Repair (APR), ExpeRepair aims to emulate how developers learn, adapt, and improve over time within a software project. In conclusion, this innovative approach draws inspiration from cognitive science's dual-memory system model and offers a promising direction for enhancing software issue resolution in real-world development scenarios.
Created on 14 Jul. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.