Developing a Llama-Based Chatbot for CI/CD Question Answering: A Case Study at Ericsson

AI-generated keywords: Llama-Based Chatbot CI/CD Question Answering Retrieval-Augmented Generation Model Industrial Setting AI-Driven Question Answering Systems

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors developed a llama-based chatbot for CI/CD question answering at Ericsson
  • Chatbot uses retrieval-augmented generation (RAG) model for accuracy and relevance
  • Ensemble retriever with BM25 and embedding retrievers showed superior performance
  • Chatbot provided fully correct responses for 61.11% of questions, partially correct answers for 26.39%, and incorrect answers for 12.50%
  • Error analysis conducted to identify causes of inaccuracies
  • Lessons learned during development process reflected upon
  • Future directions proposed to enhance chatbot accuracy
  • Research accepted for presentation at ICSME 2024
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Daksh Chaudhary, Sri Lakshmi Vadlamani, Dimple Thomas, Shiva Nejati, Mehrdad Sabetzadeh

This paper has been accepted at the 40th IEEE International Conference on Software Maintenance and Evolution (ICSME 2024)

Abstract: This paper presents our experience developing a Llama-based chatbot for question answering about continuous integration and continuous delivery (CI/CD) at Ericsson, a multinational telecommunications company. Our chatbot is designed to handle the specificities of CI/CD documents at Ericsson, employing a retrieval-augmented generation (RAG) model to enhance accuracy and relevance. Our empirical evaluation of the chatbot on industrial CI/CD-related questions indicates that an ensemble retriever, combining BM25 and embedding retrievers, yields the best performance. When evaluated against a ground truth of 72 CI/CD questions and answers at Ericsson, our most accurate chatbot configuration provides fully correct answers for 61.11% of the questions, partially correct answers for 26.39%, and incorrect answers for 12.50%. Through an error analysis of the partially correct and incorrect answers, we discuss the underlying causes of inaccuracies and provide insights for further refinement. We also reflect on lessons learned and suggest future directions for further improving our chatbot's accuracy.

Submitted to arXiv on 17 Aug. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2408.09277v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Developing a Llama-Based Chatbot for CI/CD Question Answering: A Case Study at Ericsson," authors Daksh Chaudhary, Sri Lakshmi Vadlamani, Dimple Thomas, Shiva Nejati, and Mehrdad Sabetzadeh present their experience in creating a chatbot tailored for continuous integration and continuous delivery (CI/CD) inquiries within the context of Ericsson, a prominent telecommunications company. The chatbot is specifically designed to navigate the intricacies of CI/CD documentation at Ericsson by utilizing a retrieval-augmented generation (RAG) model to enhance both accuracy and relevance. Through empirical evaluation on industrial CI/CD-related queries, the authors found that an ensemble retriever incorporating BM25 and embedding retrievers demonstrated superior performance. When compared against a set of 72 ground truth CI/CD questions and answers from Ericsson, the most accurate configuration of the chatbot provided fully correct responses for 61.11% of the questions, partially correct answers for 26.39%, and incorrect answers for 12.50%. The paper delves into an error analysis of the partially correct and incorrect responses to identify underlying causes of inaccuracies, offering valuable insights for further refinement. Additionally, the authors reflect on lessons learned during the development process and propose future directions aimed at enhancing the accuracy of their chatbot. This research has been accepted for presentation at the 40th IEEE International Conference on Software Maintenance and Evolution (ICSME 2024), showcasing its significance in advancing knowledge within the field of AI-driven question answering systems in industrial settings like Ericsson.
Created on 25 Aug. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.