Local-to-Global Panorama Inpainting for Locale-Aware Indoor Lighting Prediction

AI-generated keywords: Panorama Inpainting Convolutional Neural Networks (CNNs) Depth-guided Local Inpainting PanoTransformer Large-scale Panorama Dataset

AI-generated Key Points

Predicting indoor lighting from a single perspective image is challenging in computer vision and graphics.
The problem has been decomposed into three sub-tasks: depth-based image warping, panorama inpainting, and high-dynamic-range (HDR) reconstruction.
Panorama inpainting is critical for achieving locale-aware and robust prediction.
Recent methods rely on convolutional neural networks (CNNs), but they struggle to capture long-distance relationships and spatially-varying distortion in spherical signals.
A local-to-global strategy for large-scale panorama inpainting has been proposed, involving depth-guided local inpainting and a transformer-based network called PanoTransformer.
This pipeline significantly improves the quality of recovered indoor lighting distribution at any locale and enables high fidelity shading on inserted virtual objects.
The proposed transformer based network captures distortion-free global features from distorted signals and restores globally consistent structures accordingly.
A new large scale panorama dataset has also been collected with paired masked input and ground truth images for future research purposes.
Bullet points:
Indoor lighting prediction is challenging
Problem decomposed into 3 sub-tasks: depth-based image warping, panorama inpainting, HDR reconstruction
Panorama inpainting critical for locale-aware & robust prediction
Recent methods rely on CNNs but struggle with long-distance relationships & spatially-varying distortion
Proposed local-to-global strategy involves depth-guided local inpainting & PanoTransformer network
Pipeline improves quality of recovered indoor lighting distribution & enables high fidelity shading on virtual objects
Transformer-based network captures distortion-free global features & restores globally consistent structures
New large-scale panorama dataset collected with masked input & ground truth images

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jiayang Bai, Zhen He, Shan Yang, Jie Guo, Zhenyu Chen, Yan Zhang, Yanwen Guo

arXiv: 2303.10344v1 - DOI (cs.CV)

10 pages, 11 figures

License: CC ZERO 1.0

Abstract: Predicting panoramic indoor lighting from a single perspective image is a fundamental but highly ill-posed problem in computer vision and graphics. To achieve locale-aware and robust prediction, this problem can be decomposed into three sub-tasks: depth-based image warping, panorama inpainting and high-dynamic-range (HDR) reconstruction, among which the success of panorama inpainting plays a key role. Recent methods mostly rely on convolutional neural networks (CNNs) to fill the missing contents in the warped panorama. However, they usually achieve suboptimal performance since the missing contents occupy a very large portion in the panoramic space while CNNs are plagued by limited receptive fields. The spatially-varying distortion in the spherical signals further increases the difficulty for conventional CNNs. To address these issues, we propose a local-to-global strategy for large-scale panorama inpainting. In our method, a depth-guided local inpainting is first applied on the warped panorama to fill small but dense holes. Then, a transformer-based network, dubbed PanoTransformer, is designed to hallucinate reasonable global structures in the large holes. To avoid distortion, we further employ cubemap projection in our design of PanoTransformer. The high-quality panorama recovered at any locale helps us to capture spatially-varying indoor illumination with physically-plausible global structures and fine details.

Submitted to arXiv on 18 Mar. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2303.10344v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

Predicting panoramic indoor lighting from a single perspective image is a challenging problem in computer vision and graphics. To address this issue, researchers have decomposed it into three sub-tasks: depth-based image warping, panorama inpainting, and high-dynamic-range (HDR) reconstruction. Among these sub-tasks, the success of panorama inpainting plays a critical role in achieving locale-aware and robust prediction. Recent methods for panorama inpainting rely on convolutional neural networks (CNNs) to fill the missing contents in the warped panorama. However, CNNs are limited by their receptive fields and struggle to capture long-distance relationships that are prevalent in panoramas. Additionally, the spatially-varying distortion in spherical signals further increases the difficulty for conventional CNNs. To overcome these issues, a local-to-global strategy for large-scale panorama inpainting has been proposed. This method involves a depth-guided local inpainting applied on the warped panorama to fill small but dense holes caused by pixel stretching during image warping at a given locale. Then, a transformer-based network called PanoTransformer is designed to hallucinate reasonable global structures in the large holes with cubemap projections to avoid distortion. The proposed pipeline significantly improves the quality of recovered indoor lighting distribution at any locale and enables high fidelity and globally coherent shading on inserted virtual objects. Furthermore, it can reproduce fine texture details that are consistent with inserting points on specular surfaces. In summary, this work proposes an effective local-to-global panorama inpainting pipeline that fills missing contents while preserving spatially varying indoor illumination with physically plausible global structures and fine details at any locale. The proposed transformer based network captures distortion free global features from distorted signals and restores globally consistent structures accordingly. A new large scale panorama dataset has also been collected with paired masked input and ground truth images for future research purposes.

- Predicting indoor lighting from a single perspective image is challenging in computer vision and graphics.
- The problem has been decomposed into three sub-tasks: depth-based image warping, panorama inpainting, and high-dynamic-range (HDR) reconstruction.
- Panorama inpainting is critical for achieving locale-aware and robust prediction.
- Recent methods rely on convolutional neural networks (CNNs), but they struggle to capture long-distance relationships and spatially-varying distortion in spherical signals.
- A local-to-global strategy for large-scale panorama inpainting has been proposed, involving depth-guided local inpainting and a transformer-based network called PanoTransformer.
- This pipeline significantly improves the quality of recovered indoor lighting distribution at any locale and enables high fidelity shading on inserted virtual objects.
- The proposed transformer based network captures distortion-free global features from distorted signals and restores globally consistent structures accordingly.
- A new large scale panorama dataset has also been collected with paired masked input and ground truth images for future research purposes.
Bullet points:
- Indoor lighting prediction is challenging
- Problem decomposed into 3 sub-tasks: depth-based image warping, panorama inpainting, HDR reconstruction
- Panorama inpainting critical for locale-aware & robust prediction
- Recent methods rely on CNNs but struggle with long-distance relationships & spatially-varying distortion
- Proposed local-to-global strategy involves depth-guided local inpainting & PanoTransformer network
- Pipeline improves quality of recovered indoor lighting distribution & enables high fidelity shading on virtual objects
- Transformer-based network captures distortion-free global features & restores globally consistent structures
- New large-scale panorama dataset collected with masked input & ground truth images

Indoor lighting prediction is difficult and has been broken down into three sub-tasks: depth-based image warping, panorama inpainting, and HDR reconstruction. Panorama inpainting is important for accurate predictions. Recent methods using CNNs struggle with long-distance relationships and spatial distortion. A new local-to-global strategy involving depth-guided local inpainting and a PanoTransformer network improves the quality of indoor lighting prediction.

Predicting Panoramic Indoor Lighting from a Single Perspective Image

Introduction:

Recent Methods for Panorama Inpainting

Convolutional Neural Networks (CNNs):

Recent methods for panorama inpainting rely on convolutional neural networks (CNNs) to fill the missing contents in the warped panorama. However, CNNs are limited by their receptive fields and struggle to capture long-distance relationships that are prevalent in panoramas. Additionally, the spatially varying distortion in spherical signals further increases the difficulty for conventional CNNs.

Local to Global Strategy

Depth Guided Local Inpainting: To overcome these issues ,a local -to -global strategy for large -scale panorama inpainting has been proposed . This method involves a depth -guided local inpainting applied on the warped panorama to fill small but dense holes caused by pixel stretching during image warping at a given locale . Then ,a transformer -based network called PanoTransformer is designed to hallucinate reasonable global structuresin the large holes with cubemap projections to avoid distortion . The proposed pipeline significantly improves the quality of recovered indoor lighting distribution at any locale and enables high fidelity and globally coherent shading on inserted virtual objects . Furthermore ,it can reproduce fine texture details that are consistent with inserting points on specular surfaces .

Summary

In Summary : This work proposes an effective local -to -globalpanoramain painting pipeline that fills missing contents while preserving spatially varying indoor illumination with physically plausible global structuresand fine detailsat any locale .The proposed transformer based network captures distortion free global featuresfrom distorted signalsand restores globally consistent structures accordingly .A new large scalepanoramadatasethas also been collectedwith paired masked inputand ground truth imagesfor future research purposes.

Created on 29 Mar. 2023

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

67.0%

Layout-guided Indoor Panorama Inpainting with Plane-aware Normalization

cs.CV

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.