VTON-IT: Virtual Try-On using Image Translation

AI-generated keywords: Virtual Try-On

AI-generated Key Points

  • Virtual Try-On (VTON) is a promising application of Generative Adversarial Networks (GANs)
  • Challenges include transferring clothing items onto different body sizes, poses, and occlusions
  • VTON-IT (Virtual Try-On using Image Translation) utilizes semantic segmentation and generative adversarial architecture-based image translation network
  • Evaluation metrics used include Structural Similarity Index (SSIM), Multi-Scale Structural Similarity (MS-SSIM), Fréchet Inception Distance (FID), and Kernel Inspection Distance (KID) scores
  • VTON-IT outperformed existing approaches in producing high-resolution natural images with detailed textures on variant images
  • User study showed 70% similarity to ground truth images and 60% photo-realism
  • Challenges faced in training human body segmentation network due to improper annotations in existing datasets
  • Future work may involve expanding the application of VTON-IT to include different types of clothing items
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Santosh Adhikari, Bishnu Bhusal, Prashant Ghimire, Anil Shrestha

License: CC BY 4.0

Abstract: Virtual Try-On (trying clothes virtually) is a promising application of the Generative Adversarial Network (GAN). However, it is an arduous task to transfer the desired clothing item onto the corresponding regions of a human body because of varying body size, pose, and occlusions like hair and overlapped clothes. In this paper, we try to produce photo-realistic translated images through semantic segmentation and a generative adversarial architecture-based image translation network. We present a novel image-based Virtual Try-On application VTON-IT that takes an RGB image, segments desired body part, and overlays target cloth over the segmented body region. Most state-of-the-art GAN-based Virtual Try-On applications produce unaligned pixelated synthesis images on real-life test images. However, our approach generates high-resolution natural images with detailed textures on such variant images.

Submitted to arXiv on 06 Oct. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2310.04558v2

, , , , Virtual Try-On, also known as trying clothes virtually, is a promising application of Generative Adversarial Networks (GANs). However, it poses challenges in transferring clothing items onto different body sizes, poses, and occlusions such as hair and overlapped clothes. To address this issue, a novel approach called VTON-IT (Virtual Try-On using Image Translation) is introduced in this paper. It utilizes semantic segmentation and a generative adversarial architecture-based image translation network to produce photo-realistic translated images. The proposed method takes an RGB image, segments the desired body part, and overlays the target cloth over the segmented body region. To evaluate the effectiveness of VTON-IT, various quantitative metrics such as Structural Similarity Index (SSIM), Multi-Scale Structural Similarity (MS-SSIM), Fréchet Inception Distance (FID), and Kernel Inspection Distance (KID) scores were used to measure the similarity between ground truth images manually wrapped with clothes and synthesized images generated by the model. The results showed that VTON-IT outperformed existing approaches in producing high-resolution natural images with detailed textures on variant images. Additionally, a user study involving 60 volunteers was conducted to assess the realism and visual quality of the synthesized images. Volunteers were asked to score based on how real the clothes looked on the person and how well the texture of the clothing was preserved. The results indicated that VTON-IT achieved a 70% similarity to ground truth images and 60% photo-realism. The paper also discusses the challenges faced in training a human body segmentation network due to improper annotations in existing datasets. To address this issue, 6000 high-quality images from the FGC6 dataset were manually curated for training purposes. In conclusion, VTON-IT presents an innovative solution for Virtual Try-On applications by effectively transferring clothing onto human images while considering variations in body size, pose, and lighting conditions. The proposed architecture demonstrates superior performance in generating natural-looking synthesized images compared to existing methods. Future work may involve expanding the application of VTON-IT to include different types of clothing items such as dresses, shorts, shoes, and beyond.
Created on 27 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.