ReCoNet: Real-time Coherent Video Style Transfer Network

AI-generated keywords: Real-time Video Style Transfer ReCoNet Luminance Warping Constraint Feature-map-level Temporal Loss Perceptual Style Quality

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Existing image style transfer models based on convolutional neural networks have limitations when applied to videos
  • These models often suffer from high temporal inconsistency, meaning that the style transfer is not consistent across consecutive frames in a video
  • Some video style transfer models have been proposed to improve temporal consistency, but they sacrifice processing speed or perceptual style quality
  • The authors propose a novel real-time video style transfer model called ReCoNet
  • ReCoNet aims to generate temporally coherent style transfer videos while maintaining favorable perceptual styles
  • Two key innovations are introduced: luminance warping constraint and feature-map-level temporal loss
  • The luminance warping constraint captures luminance changes between consecutive frames and increases stylization stability under illumination effects
  • The feature-map-level temporal loss enhances temporal consistency on traceable objects in videos by ensuring consistency of distinct features throughout the video sequence
  • Experimental results demonstrate that ReCoNet achieves outstanding performance both qualitatively and quantitatively
  • ReCoNet provides fast processing speed, nice perceptual style quality, and high temporal consistency simultaneously
  • ReCoNet represents an important advancement in real-time video style transfer and can be used for various applications such as video editing and artistic expression.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Chang Gao, Derun Gu, Fangjun Zhang, Yizhou Yu

16 pages, 7 figures. For supplementary material, see https://www.dropbox.com/s/go6f7uopjjsala7/ReCoNet%20Supplementary%20Material.pdf?dl=0

Abstract: Image style transfer models based on convolutional neural networks usually suffer from high temporal inconsistency when applied to videos. Some video style transfer models have been proposed to improve temporal consistency, yet they fail to guarantee fast processing speed, nice perceptual style quality and high temporal consistency at the same time. In this paper, we propose a novel real-time video style transfer model, ReCoNet, which can generate temporally coherent style transfer videos while maintaining favorable perceptual styles. A novel luminance warping constraint is added to the temporal loss at the output level to capture luminance changes between consecutive frames and increase stylization stability under illumination effects. We also propose a novel feature-map-level temporal loss to further enhance temporal consistency on traceable objects. Experimental results indicate that our model exhibits outstanding performance both qualitatively and quantitatively.

Submitted to arXiv on 03 Jul. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1807.01197v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The existing summary discusses the limitations of existing image style transfer models based on convolutional neural networks when applied to videos. These models often suffer from high temporal inconsistency, meaning that the style transfer is not consistent across consecutive frames in a video. While some video style transfer models have been proposed to improve temporal consistency, they often sacrifice either processing speed, perceptual style quality, or both. In response to these challenges, the authors propose a novel real-time video style transfer model called ReCoNet. This model aims to generate temporally coherent style transfer videos while maintaining favorable perceptual styles. To achieve this, the authors introduce two key innovations. Firstly, they add a luminance warping constraint to the temporal loss at the output level. This constraint captures luminance changes between consecutive frames and increases stylization stability under illumination effects. By considering luminance changes, ReCoNet can produce more visually consistent and realistic style transfers. Secondly, the authors propose a feature-map-level temporal loss to further enhance temporal consistency on traceable objects in videos. This loss function ensures that objects with distinct features maintain their consistency throughout the video sequence. Experimental results demonstrate that ReCoNet achieves outstanding performance both qualitatively and quantitatively. The model successfully addresses the limitations of previous approaches by providing fast processing speed, nice perceptual style quality and high temporal consistency simultaneously. Overall, ReCoNet represents an important advancement in real-time video style transfer and its ability to generate temporally coherent and visually appealing stylized videos makes it a valuable tool for various applications such as video editing and artistic expression.
Created on 29 Aug. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.