AutoBench: Automatic Testbench Generation and Evaluation Using LLMs for HDL Design

AI-generated keywords: Digital circuit design Testbenches Large Language Models (LLMs) AutoBench tool Automated testbench evaluation framework

AI-generated Key Points

  • Testbenches are crucial in verifying hardware through simulation in digital circuit design.
  • Traditional methods of generating testbenches are manual and time-consuming.
  • Researchers have developed the AutoBench tool, the first LLM-based testbench generator for digital circuit design.
  • AutoBench can automatically generate comprehensive testbenches by providing a description of the Design Under Test (DUT).
  • The tool uses a hybrid structure and self-checking system with LLMs.
  • An automated evaluation framework was developed to assess the quality of generated testbenches.
  • Experimental results show that AutoBench outperformed the baseline approach, achieving a 57% improvement in pass@1 ratio compared to direct LLM-generated testbenches.
  • AutoBench demonstrated 3.36 times higher pass@1 ratio for sequential circuits.
  • ICARUS Verilog was used as the simulator, and Python scripts were executed on Python 3.8.10 64-bit platform gpt-4-turbo-2024-04-09.
  • Evaluation metrics Eval0, Eval1, and Eval2 were employed from AutoEval criteria using a dataset of 156 Verilog problems derived from HDLBits with RTL mutant codes incorporated.
  • AutoBench represents a significant advancement in automating testbench generation for digital circuit design by leveraging LLMs effectively.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Ruidi Qiu, Grace Li Zhang, Rolf Drechsler, Ulf Schlichtmann, Bing Li

License: CC BY-NC-SA 4.0

Abstract: In digital circuit design, testbenches constitute the cornerstone of simulation-based hardware verification. Traditional methodologies for testbench generation during simulation-based hardware verification still remain partially manual, resulting in inefficiencies in testing various scenarios and requiring expensive time from designers. Large Language Models (LLMs) have demonstrated their potential in automating the circuit design flow. However, directly applying LLMs to generate testbenches suffers from a low pass rate. To address this challenge, we introduce AutoBench, the first LLM-based testbench generator for digital circuit design, which requires only the description of the design under test (DUT) to automatically generate comprehensive testbenches. In AutoBench, a hybrid testbench structure and a self-checking system are realized using LLMs. To validate the generated testbenches, we also introduce an automated testbench evaluation framework to evaluate the quality of generated testbenches from multiple perspectives. Experimental results demonstrate that AutoBench achieves a 57% improvement in the testbench pass@1 ratio compared with the baseline that directly generates testbenches using LLMs. For 75 sequential circuits, AutoBench successfully has a 3.36 times testbench pass@1 ratio compared with the baseline. The source codes and experimental results are open-sourced at this link: https://github.com/AutoBench/AutoBench

Submitted to arXiv on 04 Jul. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2407.03891v1

In the realm of digital circuit design, testbenches play a crucial role in verifying hardware through simulation. However, traditional methods of generating testbenches are often manual and time-consuming. To address this challenge, researchers have turned to Large Language Models (LLMs) to automate the process. The AutoBench tool is the first LLM-based testbench generator for digital circuit design. It can automatically generate comprehensive testbenches by simply providing a description of the Design Under Test (DUT). The tool incorporates a hybrid structure and self-checking system using LLMs. To assess the quality of generated testbenches, an automated evaluation framework was developed. Experimental results showed that AutoBench significantly outperformed the baseline approach in terms of pass rates. It achieved a 57% improvement in pass@1 ratio compared to direct LLM-generated testbenches and demonstrated 3.36 times higher pass@1 ratio for sequential circuits. The study utilized ICARUS Verilog as the simulator and Python scripts for execution on Python 3.8.10 64-bit platform gpt-4-turbo-2024-04-09. Evaluation metrics such as Eval0, Eval1, and Eval2 were employed from AutoEval criteria using a dataset of 156 Verilog problems derived from HDLBits with RTL mutant codes incorporated. In conclusion, AutoBench represents a significant advancement in automating testbench generation for digital circuit design by effectively leveraging LLMs. The open-sourced source codes and experimental results provide valuable insights into improving hardware verification processes within the industry.
Created on 01 Apr. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.