Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions
AI-generated Key Points
- Diffusion-based generative models (DBGMs) have been successful in tasks such as image generation, editing, translation, and conditional text-to-image tasks.
- DBGMs perturb data to a target noise distribution and then reverse the process to generate samples.
- The choice of the inference diffusion process greatly impacts both likelihoods and sample quality.
- The authors propose a recipe for maximizing a lower-bound on Multivariate Diffusion Models (MDMs) likelihood without extensive model-specific analysis.
- They demonstrate how to parameterize the diffusion for a specified target noise distribution, enabling optimization of the inference diffusion process.
- Optimizing the diffusion process allows researchers to experiment with a wider range of linear diffusions automatically.
- Two new specific diffusions are introduced and a diffusion process is learned on popular datasets like MNIST, CIFAR10, and ImageNet32.
- Learned MDMs achieve or surpass bits-per-dim (BPDs) relative to fixed choices of diffusions for a given dataset and model architecture.
- Selecting an appropriate inference process is essential for DBGMs' performance.
- This work enables rapid prototyping and evaluation of different multivariate diffusions in DBGMs by providing a method to optimize the inference diffusion process without extensive analysis.
Authors: Raghav Singhal, Mark Goldstein, Rajesh Ranganath
Abstract: Diffusion-based generative models (DBGMs) perturb data to a target noise distribution and reverse this process to generate samples. The choice of noising process, or inference diffusion process, affects both likelihoods and sample quality. For example, extending the inference process with auxiliary variables leads to improved sample quality. While there are many such multivariate diffusions to explore, each new one requires significant model-specific analysis, hindering rapid prototyping and evaluation. In this work, we study Multivariate Diffusion Models (MDMs). For any number of auxiliary variables, we provide a recipe for maximizing a lower-bound on the MDMs likelihood without requiring any model-specific analysis. We then demonstrate how to parameterize the diffusion for a specified target noise distribution; these two points together enable optimizing the inference diffusion process. Optimizing the diffusion expands easy experimentation from just a few well-known processes to an automatic search over all linear diffusions. To demonstrate these ideas, we introduce two new specific diffusions as well as learn a diffusion process on the MNIST, CIFAR10, and ImageNet32 datasets. We show learned MDMs match or surpass bits-per-dims (BPDs) relative to fixed choices of diffusions for a given dataset and model architecture.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.