Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts

AI-generated keywords: Computer-Aided Design

AI-generated Key Points

  • Text2CAD is an AI framework that generates text-to-parametric CAD models using user-friendly instructions.
  • The framework utilizes Mistral and LLaVA-NeXT to create text prompts based on natural language instructions from the DeepCAD dataset.
  • An end-to-end transformer-based auto-regressive network is proposed within the Text2CAD framework for generating parametric CAD models from input texts.
  • Performance evaluation metrics include visual quality, parametric precision, and geometrical accuracy, showcasing the potential of the framework in AI-aided design applications.
  • Expert-level instructions (L3) are included in annotations for users requiring precise geometric descriptions and relative measurements for CAD modeling tasks.
  • The Text2CAD transformer architecture autonomously deduces all intermediate design steps to transform natural language descriptions into 3D CAD models.
  • Experimental analysis demonstrates superior performance compared to traditional two-stage baseline methods commonly used in similar tasks.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mohammad Sadil Khan, Sankalp Sinha, Talha Uddin Sheikh, Didier Stricker, Sk Aziz Ali, Muhammad Zeshan Afzal

Accepted in NeurIPS 2024 (Spotlight)
License: CC BY 4.0

Abstract: Prototyping complex computer-aided design (CAD) models in modern softwares can be very time-consuming. This is due to the lack of intelligent systems that can quickly generate simpler intermediate parts. We propose Text2CAD, the first AI framework for generating text-to-parametric CAD models using designer-friendly instructions for all skill levels. Furthermore, we introduce a data annotation pipeline for generating text prompts based on natural language instructions for the DeepCAD dataset using Mistral and LLaVA-NeXT. The dataset contains $\sim170$K models and $\sim660$K text annotations, from abstract CAD descriptions (e.g., generate two concentric cylinders) to detailed specifications (e.g., draw two circles with center $(x,y)$ and radius $r_{1}$, $r_{2}$, and extrude along the normal by $d$...). Within the Text2CAD framework, we propose an end-to-end transformer-based auto-regressive network to generate parametric CAD models from input texts. We evaluate the performance of our model through a mixture of metrics, including visual quality, parametric precision, and geometrical accuracy. Our proposed framework shows great potential in AI-aided design applications. Our source code and annotations will be publicly available.

Submitted to arXiv on 25 Sep. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2409.17106v1

, , , , In the realm of Computer-Aided Design (CAD), prototyping complex models can be a time-consuming task due to the lack of intelligent systems for generating simpler intermediate parts. To address this challenge, we present Text2CAD, an innovative AI framework that generates text-to-parametric CAD models using user-friendly instructions suitable for designers of all skill levels. Our framework introduces a data annotation pipeline that utilizes Mistral and LLaVA-NeXT to create text prompts based on natural language instructions from the DeepCAD dataset, which includes approximately 170,000 models and 660,000 text annotations ranging from abstract CAD descriptions to detailed specifications. Within the Text2CAD framework, we propose an end-to-end transformer-based auto-regressive network that can generate parametric CAD models from input texts. Our model's performance is evaluated based on various metrics such as visual quality, parametric precision, and geometrical accuracy. The results demonstrate the potential of our framework in AI-aided design applications. Additionally, we highlight the inclusion of expert-level instructions (L3) in our annotations for users who require precise geometric descriptions and relative measurements for their CAD modeling tasks. By generating multi-level instructions over a span of 10 days, we ensure accuracy and reduce the likelihood of hallucinations often associated with minimal metadata approaches. The Text2CAD transformer architecture is specifically designed to transform natural language descriptions into 3D CAD models by deducing all intermediate design steps autonomously. Through our experimental analysis, we showcase superior performance compared to traditional two-stage baseline methods commonly used in similar tasks. In conclusion, this paper presents Text2CAD as a groundbreaking AI framework for generating parametric 3D CAD models through textual descriptions. We provide insights into our data annotation pipeline leveraging both Language Models (LLMs) and Vision-Language Models (VLMs), introduce an end-to-end transformer-based autoregressive architecture for CAD model generation from text prompts, discuss related work in the CAD domain, present experimental results demonstrating the effectiveness of our approach, acknowledge limitations within our framework, and conclude with future research directions.
Created on 03 May. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.