Controllable Multi-domain Semantic Artwork Synthesis

AI-generated keywords: Artwork Synthesis GAN SSTAN Semantic Maps Multi-Domain

AI-generated Key Points

  • Authors present a novel framework for multi-domain synthesis of artwork from semantic layouts
  • They propose a dataset called ArtSem consisting of 40,000 images of artwork from four different domains along with their corresponding semantic label maps
  • Conditional GAN-based approach used to generate high-quality artwork from extracted semantic maps without paired training data
  • Introduction of domain-dependent variational encoders for high-quality multi-domain synthesis
  • Introduction of Spatially STyle-Adaptive Normalization (SSTAN) method to normalize both semantic and style jointly for better generation quality
  • Model learns joint representation of style and semantic information, allowing fine-grained control over synthesized artwork
  • Combination of proposed dataset and approach leads to higher quality user-controllable artwork compared to existing methods
  • Contributes to computational visual media field by providing framework for controllable multi-domain semantic artwork synthesis
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yuantian Huang, Satoshi Iizuka, Edgar Simo-Serra, Kazuhiro Fukui

15 pages, accepted by CVMJ, to appear
License: CC BY 4.0

Abstract: We present a novel framework for multi-domain synthesis of artwork from semantic layouts. One of the main limitations of this challenging task is the lack of publicly available segmentation datasets for art synthesis. To address this problem, we propose a dataset, which we call ArtSem, that contains 40,000 images of artwork from 4 different domains with their corresponding semantic label maps. We generate the dataset by first extracting semantic maps from landscape photography and then propose a conditional Generative Adversarial Network (GAN)-based approach to generate high-quality artwork from the semantic maps without necessitating paired training data. Furthermore, we propose an artwork synthesis model that uses domain-dependent variational encoders for high-quality multi-domain synthesis. The model is improved and complemented with a simple but effective normalization method, based on normalizing both the semantic and style jointly, which we call Spatially STyle-Adaptive Normalization (SSTAN). In contrast to previous methods that only take semantic layout as input, our model is able to learn a joint representation of both style and semantic information, which leads to better generation quality for synthesizing artistic images. Results indicate that our model learns to separate the domains in the latent space, and thus, by identifying the hyperplanes that separate the different domains, we can also perform fine-grained control of the synthesized artwork. By combining our proposed dataset and approach, we are able to generate user-controllable artwork that is of higher quality than existing

Submitted to arXiv on 19 Aug. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2308.10111v1

In this research article, the authors present a novel framework for multi-domain synthesis of artwork from semantic layouts. They address the main limitation of this task - the lack of publicly available segmentation datasets for art synthesis - by proposing a dataset called ArtSem. This dataset consists of 40,000 images of artwork from four different domains along with their corresponding semantic label maps. To generate the dataset, the authors first extract semantic maps from landscape photography and then propose a conditional Generative Adversarial Network (GAN)-based approach to generate high-quality artwork from these semantic maps without requiring paired training data. Additionally, they introduce an artwork synthesis model that utilizes domain-dependent variational encoders for high-quality multi-domain synthesis. The authors further enhance their model by introducing a normalization method called Spatially STyle-Adaptive Normalization (SSTAN). This method normalizes both the semantic and style jointly, leading to better generation quality when synthesizing artistic images. Unlike previous methods that only consider semantic layout as input, their model learns a joint representation of both style and semantic information. The results indicate that the authors' model successfully learns to separate the domains in the latent space. By identifying the hyperplanes that separate different domains, fine-grained control over synthesized artwork can be achieved. By combining their proposed dataset and approach, they are able to generate user-controllable artwork of higher quality than existing methods. Overall, this research contributes to the field of computational visual media by providing a framework for controllable multi-domain semantic artwork synthesis. The proposed dataset and approach offer new possibilities for generating high-quality artistic images based on semantic layouts while considering both style and semantic information.
Created on 17 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.