Picture that Sketch: Photorealistic Image Generation from Abstract Sketches

AI-generated keywords: Photorealistic images

AI-generated Key Points

  • Novel approach for generating photorealistic images from abstract sketches
  • Works with free-hand human sketches, accessible to amateurs
  • Decoupled encoder-decoder training paradigm using StyleGAN
  • Autoregressive sketch mapper bridges abstraction gap between sketch and photo
  • Specific designs including fine-grained discriminative loss and partial-aware sketch augmentation strategy
  • Downstream applications:
  • Fine-grained sketch-based image retrieval with superior performance compared to state-of-the-art methods
  • Precise semantic editing with consistent local changes in generated images
  • Fine-grained control over appearance features through multi-modal generation
  • Outperforms existing state-of-the-art approaches in terms of generating photorealistic images from abstract sketches
  • Detailed results and comparisons provided with metrics such as LPIPS and FID scores
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

Accepted in CVPR 2023. Project page available at https://subhadeepkoley.github.io/PictureThatSketch
License: CC BY-NC-SA 4.0

Abstract: Given an abstract, deformed, ordinary sketch from untrained amateurs like you and me, this paper turns it into a photorealistic image - just like those shown in Fig. 1(a), all non-cherry-picked. We differ significantly from prior art in that we do not dictate an edgemap-like sketch to start with, but aim to work with abstract free-hand human sketches. In doing so, we essentially democratise the sketch-to-photo pipeline, "picturing" a sketch regardless of how good you sketch. Our contribution at the outset is a decoupled encoder-decoder training paradigm, where the decoder is a StyleGAN trained on photos only. This importantly ensures that generated results are always photorealistic. The rest is then all centred around how best to deal with the abstraction gap between sketch and photo. For that, we propose an autoregressive sketch mapper trained on sketch-photo pairs that maps a sketch to the StyleGAN latent space. We further introduce specific designs to tackle the abstract nature of human sketches, including a fine-grained discriminative loss on the back of a trained sketch-photo retrieval model, and a partial-aware sketch augmentation strategy. Finally, we showcase a few downstream tasks our generation model enables, amongst them is showing how fine-grained sketch-based image retrieval, a well-studied problem in the sketch community, can be reduced to an image (generated) to image retrieval task, surpassing state-of-the-arts. We put forward generated results in the supplementary for everyone to scrutinise.

Submitted to arXiv on 20 Mar. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2303.11162v2

, , , , This paper presents a novel approach for generating photorealistic images from abstract sketches. Unlike previous methods that require specific edgemap-like sketches, this approach works with free-hand human sketches, making it accessible to amateurs. The key contribution of this work is a decoupled encoder-decoder training paradigm, where the decoder is a StyleGAN trained on photos only, ensuring that the generated results are always photorealistic. To bridge the abstraction gap between sketch and photo, the authors propose an autoregressive sketch mapper trained on sketch-photo pairs. This mapper maps a sketch to the latent space of the pre-trained StyleGAN to generate realistic images. To address the abstract nature of human sketches, the authors introduce specific designs including a fine-grained discriminative loss and a partial-aware sketch augmentation strategy. The paper also demonstrates several downstream applications enabled by their generation model. One application is fine-grained sketch-based image retrieval, where they convert the task into an image-based retrieval task and achieve superior performance compared to state-of-the-art methods. Another application is precise semantic editing, where users can modify specific regions of an input sketch and observe consistent local changes in the generated images. Additionally, the proposed method allows for fine-grained control over appearance features through multi-modal generation. By replacing medium or fine-level latent codes with random vectors, users can explore different color variations and appearance features in the generated images. Overall, extensive experiments show that this method outperforms existing state-of-the-art approaches in terms of generating photorealistic images from abstract sketches. The authors provide detailed results and comparisons with other methods in terms of metrics such as LPIPS and FID scores. In conclusion, this paper presents a novel approach for generating photorealistic images from abstract sketches without requiring specific edgemap-like inputs. The proposed method achieves superior performance compared to existing state-of-the-art approaches and enables various downstream applications such as fine-grained image retrieval and precise semantic editing.
Created on 09 Jan. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.