Requirements Engineering using Generative AI: Prompts and Prompting Patterns

AI-generated keywords: Automating Requirements Engineering GenAI Prompt Patterns Performance Evaluation Metrics Information Retrieval Measures

AI-generated Key Points

  • Study focused on automating Requirements Engineering (RE) tasks using GenAI
  • Utilized GPT-3.5 turbo API for automating requirement classification and tracing tasks
  • Employed five different prompt patterns for effective task performance
  • Dataset of System Requirements Specification (SRS) documents used for ground truth in requirement classification
  • Binary Classification task aimed to distinguish Functional Requirements (FR) from Non-Functional Requirements (NFR)
  • Requirements Traceability task involved identifying related requirements in SRS documents
  • Performance evaluation metrics included precision, recall, accuracy, and F-Score to assess prompt patterns effectiveness
  • Recommendations provided on selecting appropriate prompt patterns for specific RE tasks
  • Evaluation framework offered for researchers and practitioners in the field
  • Manual formatting of SRS files and removal of unnecessary information conducted for consistency in evaluation process
  • Information Retrieval measures like precision, recall, and F-measure used to evaluate tool effectiveness in RE field
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Krishna Ronanki, Beatriz Cabrero-Daniel, Jennifer Horkoff, Christian Berger

License: CC BY-NC-SA 4.0

Abstract: [Context]: Companies are increasingly recognizing the importance of automating Requirements Engineering (RE) tasks due to their resource-intensive nature. The advent of GenAI has made these tasks more amenable to automation, thanks to its ability to understand and interpret context effectively. [Problem]: However, in the context of GenAI, prompt engineering is a critical factor for success. Despite this, we currently lack tools and methods to systematically assess and determine the most effective prompt patterns to employ for a particular RE task. [Method]: Two tasks related to requirements, specifically requirement classification and tracing, were automated using the GPT-3.5 turbo API. The performance evaluation involved assessing various prompts created using 5 prompt patterns and implemented programmatically to perform the selected RE tasks, focusing on metrics such as precision, recall, accuracy, and F-Score. [Results]: This paper evaluates the effectiveness of the 5 prompt patterns' ability to make GPT-3.5 turbo perform the selected RE tasks and offers recommendations on which prompt pattern to use for a specific RE task. Additionally, it also provides an evaluation framework as a reference for researchers and practitioners who want to evaluate different prompt patterns for different RE tasks.

Submitted to arXiv on 07 Nov. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2311.03832v1

The study focused on automating Requirements Engineering (RE) tasks using GenAI and evaluating prompt patterns for effective task performance. The researchers utilized the GPT-3.5 turbo API to automate requirement classification and tracing tasks, employing five different prompt patterns. For requirement classification, a dataset of System Requirements Specification (SRS) documents with trace links was used to establish ground truth. The Binary Classification task aimed to distinguish Functional Requirements (FR) from Non-Functional Requirements (NFR), while the Requirements Traceability task involved identifying related requirements in SRS documents. Performance evaluation metrics such as precision, recall, accuracy, and F-Score were used to assess the effectiveness of prompt patterns in executing the RE tasks. The study provided recommendations on selecting appropriate prompt patterns for specific RE tasks and offered an evaluation framework for researchers and practitioners. Additionally, manual formatting of SRS files and removal of unnecessary information were conducted to ensure consistency in the evaluation process. The study adopted Information Retrieval measures like precision, recall, and F-measure to evaluate tool effectiveness in the RE field. Overall, the research contributes valuable insights into automating RE tasks using GenAI and highlights the importance of prompt engineering for successful automation in Requirements Engineering.
Created on 15 Apr. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.