Requirements Engineering using Generative AI: Prompts and Prompting Patterns
AI-generated Key Points
- Study focused on automating Requirements Engineering (RE) tasks using GenAI
- Utilized GPT-3.5 turbo API for automating requirement classification and tracing tasks
- Employed five different prompt patterns for effective task performance
- Dataset of System Requirements Specification (SRS) documents used for ground truth in requirement classification
- Binary Classification task aimed to distinguish Functional Requirements (FR) from Non-Functional Requirements (NFR)
- Requirements Traceability task involved identifying related requirements in SRS documents
- Performance evaluation metrics included precision, recall, accuracy, and F-Score to assess prompt patterns effectiveness
- Recommendations provided on selecting appropriate prompt patterns for specific RE tasks
- Evaluation framework offered for researchers and practitioners in the field
- Manual formatting of SRS files and removal of unnecessary information conducted for consistency in evaluation process
- Information Retrieval measures like precision, recall, and F-measure used to evaluate tool effectiveness in RE field
Authors: Krishna Ronanki, Beatriz Cabrero-Daniel, Jennifer Horkoff, Christian Berger
Abstract: [Context]: Companies are increasingly recognizing the importance of automating Requirements Engineering (RE) tasks due to their resource-intensive nature. The advent of GenAI has made these tasks more amenable to automation, thanks to its ability to understand and interpret context effectively. [Problem]: However, in the context of GenAI, prompt engineering is a critical factor for success. Despite this, we currently lack tools and methods to systematically assess and determine the most effective prompt patterns to employ for a particular RE task. [Method]: Two tasks related to requirements, specifically requirement classification and tracing, were automated using the GPT-3.5 turbo API. The performance evaluation involved assessing various prompts created using 5 prompt patterns and implemented programmatically to perform the selected RE tasks, focusing on metrics such as precision, recall, accuracy, and F-Score. [Results]: This paper evaluates the effectiveness of the 5 prompt patterns' ability to make GPT-3.5 turbo perform the selected RE tasks and offers recommendations on which prompt pattern to use for a specific RE task. Additionally, it also provides an evaluation framework as a reference for researchers and practitioners who want to evaluate different prompt patterns for different RE tasks.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
Look for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.