Conflict-Aware Soft Prompting for Retrieval-Augmented Generation

AI-generated keywords: Conflict-Aware Retrieval-Augmented Generation Context-Memory Conflicts Large Language Models Soft Prompting Grounded/Adversarial Training

AI-generated Key Points

  • Introduction of Conflict-Aware Retrieval-Augmented Generation (CARE) approach
  • Integration of context assessor and base LLM to handle context-memory conflicts
  • Training of context assessor through grounded/adversarial soft prompting to identify unreliable context
  • Mechanism to direct reasoning process towards correct parametric knowledge
  • Outperformance of existing methods by CARE in QA and fact-checking benchmarks with 5.0% performance gain
  • Effectiveness of CARE in discerning conflicting knowledge and guiding base LLM towards accurate responses shown through t-SNE visualization
  • Limitations include focusing on top-1 retrieved passages and single-step decoding, but represents a significant advancement in developing trustworthy and adaptive RAG systems
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Eunseong Choi, June Park, Hyeri Lee, Jongwuk Lee

Accepted to EMNLP 2025; 14 pages; 5 figures, 11 tables
License: CC BY-NC-SA 4.0

Abstract: Retrieval-augmented generation (RAG) enhances the capabilities of large language models (LLMs) by incorporating external knowledge into their input prompts. However, when the retrieved context contradicts the LLM's parametric knowledge, it often fails to resolve the conflict between incorrect external context and correct parametric knowledge, known as context-memory conflict. To tackle this problem, we introduce Conflict-Aware REtrieval-Augmented Generation (CARE), consisting of a context assessor and a base LLM. The context assessor encodes compact memory token embeddings from raw context tokens. Through grounded/adversarial soft prompting, the context assessor is trained to discern unreliable context and capture a guidance signal that directs reasoning toward the more reliable knowledge source. Extensive experiments show that CARE effectively mitigates context-memory conflicts, leading to an average performance gain of 5.0\% on QA and fact-checking benchmarks, establishing a promising direction for trustworthy and adaptive RAG systems.

Submitted to arXiv on 21 Aug. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2508.15253v1

The paper "Conflict-Aware Soft Prompting for Retrieval-Augmented Generation" introduces a novel approach called Conflict-Aware Retrieval-Augmented Generation (CARE) to address the issue of context-memory conflicts in large language models (LLMs) enhanced with retrieval-augmented generation (RAG). The core idea behind CARE is the integration of a context assessor and a base LLM to effectively handle conflicting external knowledge retrieved during the generation process. The context assessor in CARE encodes memory token embeddings from raw context tokens and is trained through grounded/adversarial soft prompting to identify unreliable context and provide guidance signals for more reliable reasoning. This mechanism helps direct the reasoning process towards the correct parametric knowledge, mitigating context-memory conflicts effectively. Extensive experiments conducted on QA and fact-checking benchmarks demonstrate that CARE outperforms existing methods, leading to an average performance gain of 5.0%. The qualitative analysis using t-SNE visualization showcases the effectiveness of CARE in discerning conflicting knowledge and guiding the base LLM towards accurate responses. However, despite its limitations such as focusing solely on top-1 retrieved passages and single-step decoding, CARE represents a significant advancement in developing trustworthy and adaptive RAG systems by addressing context-memory conflicts effectively.
Created on 28 Feb. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.