ChaTA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs

AI-generated keywords: Scalable question-answering

AI-generated Key Points

  • Study addresses challenges of scalable and intelligent question-answering (QA)
  • Leveraging open-source Large Language Models (LLMs)
  • Pipeline combines retrieval augmented generation (RAG), supervised fine-tuning (SFT), and an alternative to reinforcement learning with human feedback (RLHF)
  • Enhancing LLMs from the LLaMA-2 family
  • Experiments conducted on a Piazza dataset from an introductory CS course
  • Dataset consists of 10k QA pairs and 1.5k pairs of preferences data
  • Data privacy ensured
  • Utilizing adaptability of LLMs to offer versatile query responses
  • Comprehensive evaluation shows pipeline improves answer quality by 33%
  • RAG particularly impactful
  • Work lays foundation for ChaTA, an intelligent QA assistant customizable for courses with online QA platform
  • Effective fine-tuning of LMs on instruction data and human preferences data to improve task completion and response quality highlighted in related work
  • Challenges and future directions in utilizing machine learning for QA workflows discussed
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yann Hicke, Anmol Agarwal, Qianou Ma, Paul Denny

License: CC BY 4.0

Abstract: To address the challenges of scalable and intelligent question-answering (QA), we introduce an innovative solution that leverages open-source Large Language Models (LLMs) to ensure data privacy. We use models from the LLaMA-2 family and augmentations including retrieval augmented generation (RAG), supervised fine-tuning (SFT), and an alternative to reinforcement learning with human feedback (RLHF). We perform our experiments on a Piazza dataset from an introductory CS course with 10k QA pairs and 1.5k pairs of preferences data and conduct both human evaluations and automatic LLM evaluations on a small subset. We find preliminary evidence that modeling techniques collectively enhance the quality of answers by 33%, and RAG is an impactful addition. This work paves the way for the development of ChaTA, an intelligent QA assistant customizable for courses with an online QA platform.

Submitted to arXiv on 05 Nov. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2311.02775v1

In this study, we address the challenges of scalable and intelligent question-answering (QA) by leveraging open-source Large Language Models (LLMs). Our pipeline combines retrieval augmented generation (RAG), supervised fine-tuning (SFT), and an alternative to reinforcement learning with human feedback (RLHF) to enhance LLMs from the LLaMA-2 family. We conduct experiments on a Piazza dataset from an introductory CS course, consisting of 10k QA pairs and 1.5k pairs of preferences data, while ensuring data privacy. To overcome limitations, we utilize the adaptability of LLMs to offer versatile query responses. Our comprehensive evaluation using both LLM-based and rubric-based human evaluations shows that our pipeline improves answer quality by 33%, with RAG being particularly impactful. This work lays the foundation for ChaTA, an intelligent QA assistant customizable for courses with an online QA platform. In related work, we highlight the effectiveness of fine-tuning LMs on instruction data and human preferences data to improve task completion and response quality. We also discuss challenges and future directions in utilizing machine learning for QA workflows.
Created on 01 Feb. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.