Text2Mesh: Text-Driven Neural Stylization for Meshes

AI-generated keywords: Text2Mesh 3D Objects Neural Style Field Network CLIP Semantic Manipulation

AI-generated Key Points

  • Text2Mesh is a framework for controlling and editing the style of 3D objects
  • It uses a disentangled representation of a 3D object with a fixed mesh input and a neural network called the neural style field network
  • The framework can handle low-quality meshes without UV parameterization or specialized datasets
  • It can generate stylizations over different types of 3D meshes, including outerwear variations, muscle definition, and hair details
  • Text prompts are used as an easily modifiable and expressive means to control style
  • The technique extends beyond text-driven stylization to other modalities like images and 3D meshes using CLIP (Contrastive Language-Image Pretraining)
  • Text2Mesh enables semantic manipulation of style for 3D shapes while preserving global semantics and underlying content
  • Experiments were conducted to evaluate the method's performance, comparing it to baseline methods through user studies
  • The paper discusses limitations in their approach
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Oscar Michel, Roi Bar-On, Richard Liu, Sagie Benaim, Rana Hanocka

project page: https://threedle.github.io/text2mesh/
License: CC BY 4.0

Abstract: In this work, we develop intuitive controls for editing the style of 3D objects. Our framework, Text2Mesh, stylizes a 3D mesh by predicting color and local geometric details which conform to a target text prompt. We consider a disentangled representation of a 3D object using a fixed mesh input (content) coupled with a learned neural network, which we term neural style field network. In order to modify style, we obtain a similarity score between a text prompt (describing style) and a stylized mesh by harnessing the representational power of CLIP. Text2Mesh requires neither a pre-trained generative model nor a specialized 3D mesh dataset. It can handle low-quality meshes (non-manifold, boundaries, etc.) with arbitrary genus, and does not require UV parameterization. We demonstrate the ability of our technique to synthesize a myriad of styles over a wide variety of 3D meshes.

Submitted to arXiv on 06 Dec. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2112.03221v1

In this work, the authors present Text2Mesh, a framework for intuitive control and editing of the style of 3D objects. The framework utilizes a disentangled representation of a 3D object using a fixed mesh input coupled with a neural network called the neural style field network. This network predicts color and local geometric details that conform to a target text prompt, allowing for the stylization of 3D meshes. One notable feature of Text2Mesh is its ability to handle low-quality meshes with arbitrary genus without requiring UV parameterization or specialized 3D mesh datasets. The framework can generate stylizations over a wide variety of 3D meshes, synthesizing different styles such as outerwear variations, muscle definition and hair details. The authors emphasize the use of text prompts as an easily modifiable and expressive means to control style. They demonstrate that their technique extends beyond text-driven stylization to other target modalities like images and 3D meshes by harnessing the representational power of CLIP (Contrastive Language-Image Pretraining). Text2Mesh enables semantic manipulation of style for 3D shapes. The paper also discusses related work in text-driven manipulation and highlights how their method differs from existing techniques. They provide examples showcasing the fine-grained controls offered by their approach, generating high-fidelity details while preserving global semantics and underlying content. The authors conduct experiments to evaluate their method's performance and explore different aspects such as learning color and geometry together, comparing it to baseline methods through user studies. They also discuss limitations in their approach. Overall, Text2Mesh presents an innovative technique for semantic manipulation of style in 3D meshes using text prompts and CLIP-guided neural networks. It offers intuitive controls for generating diverse stylizations across various types of 3D objects while maintaining high fidelity and handling low-quality meshes.
Created on 11 Dec. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.