Extracting Knowledge Graphs from User Stories using LangChain

AI-generated keywords: Knowledge graphs User stories Large Language Models (LLMs) LangChain framework Requirements engineering

AI-generated Key Points

  • Novel methodology for generating knowledge graphs from user stories using Large Language Models (LLMs)
  • Addressing limitations of user stories in capturing overall system perspective in software development
  • Proposal to extract structured data from user stories and model them into knowledge graphs for enhanced comprehension
  • Development of LangChain framework and User Story Graph Transformer module for automating knowledge graph extraction process
  • Importance of aligning software functionalities with user expectations for more effective and user-centric software development
  • Acknowledgment of key contributors to the research, including Prof. Dr. Leen Lambers, Dr. Kate Revoredo, Dr. Sébastien Mosser, and Prof. Dr. Douglas Cunningham
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Thayná Camargo da Silva

Master thesis work
License: CC BY-NC-SA 4.0

Abstract: This thesis introduces a novel methodology for the automated generation of knowledge graphs from user stories by leveraging the advanced capabilities of Large Language Models. Utilizing the LangChain framework as a basis, the User Story Graph Transformer module was developed to extract nodes and relationships from user stories using an LLM to construct accurate knowledge graphs.This innovative technique was implemented in a script to fully automate the knowledge graph extraction process. Additionally, the evaluation was automated through a dedicated evaluation script, utilizing an annotated dataset for assessment. By enhancing the visualization and understanding of user requirements and domain concepts, this method fosters better alignment between software functionalities and user expectations, ultimately contributing to more effective and user-centric software development processes.

Submitted to arXiv on 14 May. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2506.11020v1

This Master's thesis presents a novel methodology for automatically generating knowledge graphs from user stories using the advanced capabilities of Large Language Models (LLMs). The research addresses the limitations of user stories in capturing the overall system perspective in software development. User stories are natural language descriptions of software requirements widely used in agile methodologies but often lack structured information for comprehensive system understanding. To overcome these challenges, the thesis proposes extracting structured data from user stories and modeling them into knowledge graphs. These visual and structured representations enhance data storage, analysis, and system comprehension. The LangChain framework serves as the basis for developing the User Story Graph Transformer module, which utilizes an LLM to accurately extract nodes and relationships from user stories. This innovative technique automates the knowledge graph extraction process through a script, streamlining the visualization and understanding of user requirements and domain concepts. An evaluation script with an annotated dataset is used to automate the assessment of knowledge graph accuracy. By improving alignment between software functionalities and user expectations, this method contributes to more effective and user-centric software development processes. The thesis acknowledges key contributors to the research including Prof. Dr. Leen Lambers, Dr. Kate Revoredo, Dr. Sébastien Mosser, and Prof. Dr. Douglas Cunningham for their support throughout the academic journey. plays a crucial role in defining software from a user's perspective; thus highlighting the significance of extracting structured information from user stories for creating coherent and manageable systems. Overall, this research provides valuable insights into utilizing LLMs for knowledge graph generation from user stories to enhance software development processes and improve system comprehension through structured representations of requirements.
Created on 22 Mar. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.