Graph-Guided Concept Selection for Efficient Retrieval-Augmented Generation

AI-generated keywords: Graph-Based RAG

AI-generated Key Points

  • The Graph-Based RAG framework enhances retrieval and question answering in Large Language Model (LLM) systems by constructing knowledge graphs (KG) from text chunks.
  • G2ConS is a new approach that optimizes KG construction costs while maintaining retrieval effectiveness and answering quality.
  • G2ConS incorporates a chunk selection method to reduce overall cost of KG construction and an LLM-independent concept graph to fill knowledge gaps without additional costs.
  • G2ConS outperforms existing methods like GraphRAG, HippoRAG, LightRAG, KAG, FastRAG, and GraphReader in terms of construction cost efficiency, retrieval effectiveness, and answering quality across multiple real-world datasets.
  • G2ConS emphasizes concept selection in graph construction to achieve consistent improvements in both cost efficiency and performance compared to traditional methods.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Ziyu Liu, Yijing Liu, Jianfei Yuan, Minzhi Yan, Le Yue, Honghui Xiong, Yi Yang

License: CC BY 4.0

Abstract: Graph-based RAG constructs a knowledge graph (KG) from text chunks to enhance retrieval in Large Language Model (LLM)-based question answering. It is especially beneficial in domains such as biomedicine, law, and political science, where effective retrieval often involves multi-hop reasoning over proprietary documents. However, these methods demand numerous LLM calls to extract entities and relations from text chunks, incurring prohibitive costs at scale. Through a carefully designed ablation study, we observe that certain words (termed concepts) and their associated documents are more important. Based on this insight, we propose Graph-Guided Concept Selection (G2ConS). Its core comprises a chunk selection method and an LLM-independent concept graph. The former selects salient document chunks to reduce KG construction costs; the latter closes knowledge gaps introduced by chunk selection at zero cost. Evaluations on multiple real-world datasets show that G2ConS outperforms all baselines in construction cost, retrieval effectiveness, and answering quality.

Submitted to arXiv on 28 Oct. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2510.24120v1

, , , , The Graph-Based RAG framework has been instrumental in enhancing retrieval and question answering in Large Language Model (LLM) systems by constructing knowledge graphs (KG) from text chunks. This approach has proven particularly beneficial in domains like biomedicine, law, and political science, where effective retrieval often requires multi-hop reasoning over proprietary documents. However, the reliance on numerous LLM calls for entity and relation extraction from text chunks can result in prohibitive costs at scale. To address this challenge, a new approach called Graph-Guided Concept Selection (G2ConS) has been proposed. G2ConS incorporates a chunk selection method and an LLM-independent concept graph to optimize KG construction costs while maintaining retrieval effectiveness and answering quality. The chunk selection method identifies salient document chunks to reduce the overall cost of KG construction, while the concept graph helps fill knowledge gaps introduced by chunk selection without additional costs. In comparison to existing methods such as GraphRAG Edge et al. (2024), HippoRAG Jimenez Gutierrez et al. (2024), LightRAG Guo et al. (2024), KAG Liang et al. (2024), FastRAG Abane et al. (2024), and GraphReader Li et al. (2024b), G2ConS demonstrates superior performance in terms of construction cost efficiency, retrieval effectiveness, and answering quality across multiple real-world datasets. By combining KG and concept graphs in a hybrid retrieval strategy, G2ConS offers optimal performance while remaining compatible with mainstream GraphRAG approaches. Furthermore, previous efforts to enhance RAG performance on multi-hop reasoning tasks through KG construction have faced challenges due to high construction costs. Approaches like LightRAG Guo et al. (2024) and HiRAG Huang et al. (2025a) have attempted to simplify KG construction processes but may suffer from reduced accuracy on complex tasks. In contrast, G2ConS emphasizes concept selection in graph construction to achieve consistent improvements in both cost efficiency and performance compared to traditional methods. Overall, the introduction of G2ConS represents a significant advancement in optimizing KG-based RAG frameworks for efficient retrieval and question answering across diverse domains while mitigating prohibitive costs associated with large-scale operations.
Created on 21 Feb. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.