RAG4Tickets: AI-Powered Ticket Resolution via Retrieval-Augmented Generation on JIRA and GitHub Data

AI-generated keywords: Software development Fragmented knowledge Retrieval-Augmented Generation (RAG) Unified pipeline DevOps workflows

AI-generated Key Points

Challenges faced by software teams in resolving recurring or related issues due to fragmented knowledge across platforms like JIRA tickets, developer discussions, and GitHub pull requests
Introduction of the Retrieval-Augmented Generation (RAG) framework integrating Sentence-Transformers with FAISS-based vector search for context-aware ticket resolution recommendations
Utilization of historical JIRA tickets, user comments, and linked PR metadata to retrieve semantically similar past cases and generate explainable resolution suggestions using a Large Language Model (LLM)
Aim of the framework to improve resolution accuracy, fix quality, and knowledge reuse in modern DevOps environments through a unified pipeline linking JIRA and GitHub data
Challenges in implementing the RAG framework such as unreliable suggestions from LLM, historical data bias, dataset drift, lack of explainability, scalability issues, and cost overhead
Proposed solutions for overcoming challenges including retrieval grounding through evidence-weighted decoding, temporal relevance weighting for dataset drift, explainability enhancements, periodic re-indexing and embedding refresh
Future work suggestions like adaptive learning pipelines with online embedding updates for scalability, model distillation for new domains, incremental fine-tuning driven by recent commits for evolving enterprise data

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mohammad Baqar

arXiv: 2510.08667v1 - DOI (cs.SE)

13 Pages

License: CC BY 4.0

Abstract: Modern software teams frequently encounter delays in resolving recurring or related issues due to fragmented knowledge scattered across JIRA tickets, developer discussions, and GitHub pull requests (PRs). To address this challenge, we propose a Retrieval-Augmented Generation (RAG) framework that integrates Sentence-Transformers for semantic embeddings with FAISS-based vector search to deliver context-aware ticket resolution recommendations. The approach embeds historical JIRA tickets, user comments, and linked PR metadata to retrieve semantically similar past cases, which are then synthesized by a Large Language Model (LLM) into grounded and explainable resolution suggestions. The framework contributes a unified pipeline linking JIRA and GitHub data, an embedding and FAISS indexing strategy for heterogeneous software artifacts, and a resolution generation module guided by retrieved evidence. Experimental evaluation using precision, recall, resolution time reduction, and developer acceptance metrics shows that the proposed system significantly improves resolution accuracy, fix quality, and knowledge reuse in modern DevOps environments.

Submitted to arXiv on 09 Oct. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2510.08667v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

In modern software development environments, software teams often face challenges in resolving recurring or related issues due to fragmented knowledge scattered across various platforms such as JIRA tickets, developer discussions, and GitHub pull requests. To address this issue, a Retrieval-Augmented Generation (RAG) framework has been proposed. This framework integrates Sentence-Transformers for semantic embeddings with FAISS-based vector search to provide context-aware ticket resolution recommendations. The approach involves embedding historical JIRA tickets, user comments, and linked PR metadata to retrieve semantically similar past cases. These retrieved cases are then synthesized by a Large Language Model (LLM) into grounded and explainable resolution suggestions. The framework aims to improve resolution accuracy, fix quality, and knowledge reuse in modern DevOps environments through a unified pipeline linking JIRA and GitHub data. : However, there are challenges that need to be addressed for the successful implementation of the RAG framework. These include issues such as unreliable or misleading suggestions generated by the LLM not grounded in retrieved evidence, historical data bias from outdated or low-quality fixes in legacy tickets and PRs, dataset drift due to changes in frameworks reducing relevance of older examples and embeddings, lack of explainability leading to developer hesitation in adopting AI recommendations without transparent supporting evidence, scalability and cost overhead from indexing large-scale repositories and frequent LLM inference increasing compute and operational costs. : To overcome these challenges,such as retrieval grounding through evidence-weighted decoding,, , , < kd>temporal relevance weighting</kd> for dataset drift; for explainability; periodic re-indexing and embedding refresh for knowledge obsolescence have been suggested. : The future work includes adaptive learning pipelines with online embedding updates for scalability; model distillation for new domains; incremental fine-tuning driven by recent commits for evolving enterprise data. Overall, the RAG framework demonstrates how retrieval and generation can be effectively fused to enhance AI-assisted ticket triage and resolution generation processes in DevOps workflows.

- Challenges faced by software teams in resolving recurring or related issues due to fragmented knowledge across platforms like JIRA tickets, developer discussions, and GitHub pull requests
- Introduction of the Retrieval-Augmented Generation (RAG) framework integrating Sentence-Transformers with FAISS-based vector search for context-aware ticket resolution recommendations
- Utilization of historical JIRA tickets, user comments, and linked PR metadata to retrieve semantically similar past cases and generate explainable resolution suggestions using a Large Language Model (LLM)
- Aim of the framework to improve resolution accuracy, fix quality, and knowledge reuse in modern DevOps environments through a unified pipeline linking JIRA and GitHub data
- Challenges in implementing the RAG framework such as unreliable suggestions from LLM, historical data bias, dataset drift, lack of explainability, scalability issues, and cost overhead
- Proposed solutions for overcoming challenges including retrieval grounding through evidence-weighted decoding, temporal relevance weighting for dataset drift, explainability enhancements, periodic re-indexing and embedding refresh
- Future work suggestions like adaptive learning pipelines with online embedding updates for scalability, model distillation for new domains, incremental fine-tuning driven by recent commits for evolving enterprise data

Summary- Software teams face difficulties in solving recurring problems because knowledge is spread across different platforms like JIRA, developer discussions, and GitHub. - A new framework called Retrieval-Augmented Generation (RAG) combines Sentence-Transformers with FAISS-based vector search to provide context-aware recommendations for issue resolution. - The RAG framework uses historical JIRA tickets, user comments, and pull request metadata to find similar past cases and suggest solutions using a Large Language Model (LLM). - The goal of RAG is to enhance accuracy, quality, and knowledge sharing in DevOps by connecting JIRA and GitHub data through a unified system. - Challenges in implementing RAG include unreliable suggestions from LLM, bias in historical data, dataset changes over time, lack of clarity, scalability problems, and increased costs. Definitions- Recurring: Happening repeatedly - Framework: A structure or system designed to support something - Context-aware: Able to understand the situation or circumstances - Resolution: Finding a solution to a problem - DevOps: A software development methodology that combines development (Dev) with operations (Ops)

In the fast-paced world of software development, teams are constantly facing challenges in resolving recurring or related issues. These challenges are often due to fragmented knowledge scattered across various platforms such as JIRA tickets, developer discussions, and GitHub pull requests. This not only leads to inefficiencies and delays in issue resolution but also hinders knowledge reuse within the team. To address this issue, a research paper titled "Retrieval-Augmented Generation for Knowledge-Intensive Software Development" proposes a framework called Retrieval-Augmented Generation (RAG). The RAG framework integrates Sentence-Transformers for semantic embeddings with FAISS-based vector search to provide context-aware ticket resolution recommendations. Let's dive deeper into this innovative approach and understand how it can improve resolution accuracy, fix quality, and knowledge reuse in modern DevOps environments. The RAG framework involves embedding historical JIRA tickets, user comments, and linked PR metadata using Sentence-Transformers. These embeddings capture the semantic meaning of the text data and create a numerical representation of it. Then, these embeddings are indexed using FAISS (Facebook AI Similarity Search) - an efficient library for similarity search on large-scale datasets. This indexing allows for quick retrieval of semantically similar past cases when a new issue is raised. Once the relevant cases have been retrieved from the index, they are synthesized by a Large Language Model (LLM) into grounded and explainable resolution suggestions. LLMs are powerful neural network models trained on vast amounts of text data that can generate human-like language responses based on input prompts. In this case, LLMs use the retrieved evidence as input prompts to generate potential solutions for the current issue at hand. One of the key strengths of the RAG framework is its ability to provide context-aware recommendations by linking JIRA and GitHub data through a unified pipeline. This integration allows developers to access all relevant information in one place without having to switch between different platforms or tools. This not only saves time and effort but also improves collaboration and knowledge sharing within the team. However, like any other research paper, there are challenges that need to be addressed for the successful implementation of the RAG framework. These include issues such as unreliable or misleading suggestions generated by the LLM not grounded in retrieved evidence, historical data bias from outdated or low-quality fixes in legacy tickets and PRs, dataset drift due to changes in frameworks reducing relevance of older examples and embeddings, lack of explainability leading to developer hesitation in adopting AI recommendations without transparent supporting evidence, scalability and cost overhead from indexing large-scale repositories and frequent LLM inference increasing compute and operational costs. To overcome these challenges, the paper suggests various solutions such as retrieval grounding through evidence-weighted decoding for reliable suggestions based on strong evidence. The use of temporal relevance weighting can help address dataset drift by giving more weightage to recent cases while retrieving similar past cases. For explainability, techniques such as attention mechanisms can be used to highlight important parts of the input prompts that led to a particular recommendation. Periodic re-indexing and embedding refresh can also help combat knowledge obsolescence caused by changes in frameworks or technologies. The future work proposed by this research paper includes adaptive learning pipelines with online embedding updates for scalability; model distillation for new domains; incremental fine-tuning driven by recent commits for evolving enterprise data. These advancements will further enhance the capabilities of the RAG framework and make it more suitable for real-world applications. In conclusion, the RAG framework demonstrates how retrieval and generation can be effectively fused to enhance AI-assisted ticket triage and resolution generation processes in DevOps workflows. By leveraging semantic embeddings, efficient indexing techniques, powerful language models, and integration with existing platforms, this framework has immense potential to improve issue resolution efficiency while promoting knowledge reuse within software development teams. With further developments suggested by this research paper being implemented in practice, we can expect to see a significant improvement in issue resolution processes and overall productivity in modern software development environments.

Created on 25 Jun. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

66.8%

Seven Failure Points When Engineering a Retrieval Augmented Generation System

cs.SE

63.5%

Context Matters: Evaluating Context Strategies for Automated ADR Generation Usi…

cs.SE

54.2%

Exploring LLM-based Agents for Root Cause Analysis

cs.SE

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.