Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction

AI-generated keywords: Panorama Inpainting Convolutional Neural Networks (CNNs) Depth-guided Local Inpainting PanoTransformer Large-scale Panorama Dataset

AI-generated Key Points

  • Predicting indoor lighting from a single perspective image is challenging in computer vision and graphics.
  • The problem has been decomposed into three sub-tasks: depth-based image warping, panorama inpainting, and high-dynamic-range (HDR) reconstruction.
  • Panorama inpainting is critical for achieving locale-aware and robust prediction.
  • Recent methods rely on convolutional neural networks (CNNs), but they struggle to capture long-distance relationships and spatially-varying distortion in spherical signals.
  • A local-to-global strategy for large-scale panorama inpainting has been proposed, involving depth-guided local inpainting and a transformer-based network called PanoTransformer.
  • This pipeline significantly improves the quality of recovered indoor lighting distribution at any locale and enables high fidelity shading on inserted virtual objects.
  • The proposed transformer based network captures distortion-free global features from distorted signals and restores globally consistent structures accordingly.
  • A new large scale panorama dataset has also been collected with paired masked input and ground truth images for future research purposes.
  • Bullet points:
  • Indoor lighting prediction is challenging
  • Problem decomposed into 3 sub-tasks: depth-based image warping, panorama inpainting, HDR reconstruction
  • Panorama inpainting critical for locale-aware & robust prediction
  • Recent methods rely on CNNs but struggle with long-distance relationships & spatially-varying distortion
  • Proposed local-to-global strategy involves depth-guided local inpainting & PanoTransformer network
  • Pipeline improves quality of recovered indoor lighting distribution & enables high fidelity shading on virtual objects
  • Transformer-based network captures distortion-free global features & restores globally consistent structures
  • New large-scale panorama dataset collected with masked input & ground truth images
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jiayang Bai, Zhen He, Shan Yang, Jie Guo, Zhenyu Chen, Yan Zhang, Yanwen Guo

10 pages, 11 figures
License: CC ZERO 1.0

Abstract: Predicting panoramic indoor lighting from a single perspective image is a fundamental but highly ill-posed problem in computer vision and graphics. To achieve locale-aware and robust prediction, this problem can be decomposed into three sub-tasks: depth-based image warping, panorama inpainting and high-dynamic-range (HDR) reconstruction, among which the success of panorama inpainting plays a key role. Recent methods mostly rely on convolutional neural networks (CNNs) to fill the missing contents in the warped panorama. However, they usually achieve suboptimal performance since the missing contents occupy a very large portion in the panoramic space while CNNs are plagued by limited receptive fields. The spatially-varying distortion in the spherical signals further increases the difficulty for conventional CNNs. To address these issues, we propose a local-to-global strategy for large-scale panorama inpainting. In our method, a depth-guided local inpainting is first applied on the warped panorama to fill small but dense holes. Then, a transformer-based network, dubbed PanoTransformer, is designed to hallucinate reasonable global structures in the large holes. To avoid distortion, we further employ cubemap projection in our design of PanoTransformer. The high-quality panorama recovered at any locale helps us to capture spatially-varying indoor illumination with physically-plausible global structures and fine details.

Submitted to arXiv on 18 Mar. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2303.10344v1

Predicting panoramic indoor lighting from a single perspective image is a challenging problem in computer vision and graphics. To address this issue, researchers have decomposed it into three sub-tasks: depth-based image warping, panorama inpainting, and high-dynamic-range (HDR) reconstruction. Among these sub-tasks, the success of panorama inpainting plays a critical role in achieving locale-aware and robust prediction. Recent methods for panorama inpainting rely on convolutional neural networks (CNNs) to fill the missing contents in the warped panorama. However, CNNs are limited by their receptive fields and struggle to capture long-distance relationships that are prevalent in panoramas. Additionally, the spatially-varying distortion in spherical signals further increases the difficulty for conventional CNNs. To overcome these issues, a local-to-global strategy for large-scale panorama inpainting has been proposed. This method involves a depth-guided local inpainting applied on the warped panorama to fill small but dense holes caused by pixel stretching during image warping at a given locale. Then, a transformer-based network called PanoTransformer is designed to hallucinate reasonable global structures in the large holes with cubemap projections to avoid distortion. The proposed pipeline significantly improves the quality of recovered indoor lighting distribution at any locale and enables high fidelity and globally coherent shading on inserted virtual objects. Furthermore, it can reproduce fine texture details that are consistent with inserting points on specular surfaces. In summary, this work proposes an effective local-to-global panorama inpainting pipeline that fills missing contents while preserving spatially varying indoor illumination with physically plausible global structures and fine details at any locale. The proposed transformer based network captures distortion free global features from distorted signals and restores globally consistent structures accordingly. A new large scale panorama dataset has also been collected with paired masked input and ground truth images for future research purposes.
Created on 29 Mar. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.