DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

AI-generated keywords: DreamCraft3D 3D content generation 2D reference image Bootstrapped Score Distillation view-consistent guidance

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • DreamCraft3D is a hierarchical 3D content generation method
  • It produces high-fidelity and coherent 3D objects
  • It addresses the consistency issue faced by existing methods
  • It uses a 2D reference image to guide geometry sculpting and texture boosting
  • DreamCraft3D employs score distillation sampling through a view-dependent diffusion model for coherently rendered geometries
  • The authors propose Bootstrapped Score Distillation to boost texture fidelity
  • They train a personalized diffusion model called Dreambooth on augmented renderings of the scene
  • The optimization process involves alternating optimization of the diffusion prior and 3D scene representation
  • The optimized 3D scene aids in training the scene-specific diffusion model for view-consistent guidance in 3D optimization.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jingxiang Sun, Bo Zhang, Ruizhi Shao, Lizhen Wang, Wen Liu, Zhenda Xie, Yebin Liu

Project Page: https://mrtornado24.github.io/DreamCraft3D/

Abstract: We present DreamCraft3D, a hierarchical 3D content generation method that produces high-fidelity and coherent 3D objects. We tackle the problem by leveraging a 2D reference image to guide the stages of geometry sculpting and texture boosting. A central focus of this work is to address the consistency issue that existing works encounter. To sculpt geometries that render coherently, we perform score distillation sampling via a view-dependent diffusion model. This 3D prior, alongside several training strategies, prioritizes the geometry consistency but compromises the texture fidelity. We further propose Bootstrapped Score Distillation to specifically boost the texture. We train a personalized diffusion model, Dreambooth, on the augmented renderings of the scene, imbuing it with 3D knowledge of the scene being optimized. The score distillation from this 3D-aware diffusion prior provides view-consistent guidance for the scene. Notably, through an alternating optimization of the diffusion prior and 3D scene representation, we achieve mutually reinforcing improvements: the optimized 3D scene aids in training the scene-specific diffusion model, which offers increasingly view-consistent guidance for 3D optimization. The optimization is thus bootstrapped and leads to substantial texture boosting. With tailored 3D priors throughout the hierarchical generation, DreamCraft3D generates coherent 3D objects with photorealistic renderings, advancing the state-of-the-art in 3D content generation. Code available at https://github.com/deepseek-ai/DreamCraft3D.

Submitted to arXiv on 25 Oct. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2310.16818v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

DreamCraft3D is a novel hierarchical 3D content generation method that produces high-fidelity and coherent 3D objects. It addresses the consistency issue faced by existing methods by leveraging a 2D reference image to guide the stages of geometry sculpting and texture boosting. To ensure coherently rendered geometries, DreamCraft3D employs score distillation sampling through a view-dependent diffusion model. This 3D prior combined with various training strategies prioritizes geometry consistency but compromises texture fidelity. To overcome this limitation, the authors propose Bootstrapped Score Distillation which specifically focuses on boosting texture. The authors train a personalized diffusion model called Dreambooth on augmented renderings of the scene to imbue it with 3D knowledge of the optimized scene and obtain view-consistent guidance for generating the final output. The optimization process involves an alternating optimization of the diffusion prior and 3D scene representation resulting in mutually reinforcing improvements. The optimized 3D scene aids in training the scene-specific diffusion model which offers increasingly view-consistent guidance for 3D optimization.
Created on 31 Oct. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.