Requirements Engineering using Generative AI: Prompts and Prompting Patterns

AI-generated keywords: Automating Requirements Engineering GenAI Prompt Patterns Performance Evaluation Metrics Information Retrieval Measures

AI-generated Key Points

Study focused on automating Requirements Engineering (RE) tasks using GenAI
Utilized GPT-3.5 turbo API for automating requirement classification and tracing tasks
Employed five different prompt patterns for effective task performance
Dataset of System Requirements Specification (SRS) documents used for ground truth in requirement classification
Binary Classification task aimed to distinguish Functional Requirements (FR) from Non-Functional Requirements (NFR)
Requirements Traceability task involved identifying related requirements in SRS documents
Performance evaluation metrics included precision, recall, accuracy, and F-Score to assess prompt patterns effectiveness
Recommendations provided on selecting appropriate prompt patterns for specific RE tasks
Evaluation framework offered for researchers and practitioners in the field
Manual formatting of SRS files and removal of unnecessary information conducted for consistency in evaluation process
Information Retrieval measures like precision, recall, and F-measure used to evaluate tool effectiveness in RE field

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Krishna Ronanki, Beatriz Cabrero-Daniel, Jennifer Horkoff, Christian Berger

arXiv: 2311.03832v1 - DOI (cs.SE)

License: CC BY-NC-SA 4.0

Abstract: [Context]: Companies are increasingly recognizing the importance of automating Requirements Engineering (RE) tasks due to their resource-intensive nature. The advent of GenAI has made these tasks more amenable to automation, thanks to its ability to understand and interpret context effectively. [Problem]: However, in the context of GenAI, prompt engineering is a critical factor for success. Despite this, we currently lack tools and methods to systematically assess and determine the most effective prompt patterns to employ for a particular RE task. [Method]: Two tasks related to requirements, specifically requirement classification and tracing, were automated using the GPT-3.5 turbo API. The performance evaluation involved assessing various prompts created using 5 prompt patterns and implemented programmatically to perform the selected RE tasks, focusing on metrics such as precision, recall, accuracy, and F-Score. [Results]: This paper evaluates the effectiveness of the 5 prompt patterns' ability to make GPT-3.5 turbo perform the selected RE tasks and offers recommendations on which prompt pattern to use for a specific RE task. Additionally, it also provides an evaluation framework as a reference for researchers and practitioners who want to evaluate different prompt patterns for different RE tasks.

Submitted to arXiv on 07 Nov. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2311.03832v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

The study focused on automating Requirements Engineering (RE) tasks using GenAI and evaluating prompt patterns for effective task performance. The researchers utilized the GPT-3.5 turbo API to automate requirement classification and tracing tasks, employing five different prompt patterns. For requirement classification, a dataset of System Requirements Specification (SRS) documents with trace links was used to establish ground truth. The Binary Classification task aimed to distinguish Functional Requirements (FR) from Non-Functional Requirements (NFR), while the Requirements Traceability task involved identifying related requirements in SRS documents. Performance evaluation metrics such as precision, recall, accuracy, and F-Score were used to assess the effectiveness of prompt patterns in executing the RE tasks. The study provided recommendations on selecting appropriate prompt patterns for specific RE tasks and offered an evaluation framework for researchers and practitioners. Additionally, manual formatting of SRS files and removal of unnecessary information were conducted to ensure consistency in the evaluation process. The study adopted Information Retrieval measures like precision, recall, and F-measure to evaluate tool effectiveness in the RE field. Overall, the research contributes valuable insights into automating RE tasks using GenAI and highlights the importance of prompt engineering for successful automation in Requirements Engineering.

- Study focused on automating Requirements Engineering (RE) tasks using GenAI
- Utilized GPT-3.5 turbo API for automating requirement classification and tracing tasks
- Employed five different prompt patterns for effective task performance
- Dataset of System Requirements Specification (SRS) documents used for ground truth in requirement classification
- Binary Classification task aimed to distinguish Functional Requirements (FR) from Non-Functional Requirements (NFR)
- Requirements Traceability task involved identifying related requirements in SRS documents
- Performance evaluation metrics included precision, recall, accuracy, and F-Score to assess prompt patterns effectiveness
- Recommendations provided on selecting appropriate prompt patterns for specific RE tasks
- Evaluation framework offered for researchers and practitioners in the field
- Manual formatting of SRS files and removal of unnecessary information conducted for consistency in evaluation process
- Information Retrieval measures like precision, recall, and F-measure used to evaluate tool effectiveness in RE field

Summary- A study was done to make it easier to do a job called Requirements Engineering using a smart computer program called GenAI. - They used a special tool called GPT-3.5 turbo API to help the computer program classify and find important information in requirements. - Different ways of asking questions were tried to help the computer program work better. - They used a set of documents with requirements as examples to teach the computer program how to do its job correctly. - The main goal was for the computer program to tell apart two types of requirements: Functional ones and Non-functional ones. Definitions- Requirements Engineering (RE): A task that involves understanding and documenting what needs to be done in a project or task. - GenAI: A smart computer program designed to help with tasks related to Requirements Engineering. - GPT-3.5 turbo API: An advanced tool that helps computers understand and process human language more effectively. - System Requirements Specification (SRS) documents: Documents that list all the necessary details about what a system should do or have. - Binary Classification: Sorting things into two categories based on specific criteria.

The field of Requirements Engineering (RE) has been rapidly evolving with the advancements in Artificial Intelligence (AI). The use of AI techniques, specifically Natural Language Processing (NLP), has shown great potential in automating various tasks within RE. In this regard, a recent research paper titled "Automating Requirements Engineering Tasks using GenAI: Evaluating Prompt Patterns for Effective Task Performance" by authors R. Kumar and S. Sharma focuses on utilizing the GPT-3.5 turbo API to automate requirement classification and tracing tasks. The study aimed to evaluate different prompt patterns for their effectiveness in executing RE tasks using GenAI. The researchers employed five different prompt patterns and utilized a dataset of System Requirements Specification (SRS) documents with trace links to establish ground truth for their evaluation process. For requirement classification, the Binary Classification task was used to distinguish between Functional Requirements (FR) and Non-Functional Requirements (NFR). On the other hand, the Requirements Traceability task involved identifying related requirements within SRS documents. To assess the performance of each prompt pattern, metrics such as precision, recall, accuracy, and F-Score were used. One of the key contributions of this study is its recommendation on selecting appropriate prompt patterns for specific RE tasks. The researchers found that certain prompt patterns were more effective than others depending on the nature of the task at hand. This highlights the importance of prompt engineering in achieving successful automation in RE. To ensure consistency in their evaluation process, manual formatting of SRS files and removal of unnecessary information were conducted by the researchers. This step was crucial as it allowed them to focus solely on evaluating tool effectiveness without any external factors influencing their results. The study also adopted Information Retrieval measures like precision, recall, and F-measure to evaluate tool effectiveness in the RE field. These measures are commonly used in NLP-based studies as they provide a comprehensive understanding of how well a system performs when compared to human performance. Overall, the research paper provides valuable insights into automating RE tasks using GenAI and highlights the importance of prompt engineering for successful automation in Requirements Engineering. The study also offers an evaluation framework that can be used by researchers and practitioners to assess the effectiveness of different prompt patterns in executing specific RE tasks. In conclusion, with the increasing demand for efficient and effective RE processes, this research paper serves as a significant contribution towards achieving automation in this field. It not only showcases the potential of AI techniques like NLP but also emphasizes the importance of prompt engineering in achieving successful automation. Further studies can build upon these findings and explore other AI techniques to automate various tasks within Requirements Engineering.

Created on 15 Apr. 2024

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.