RomniStereo: Recurrent Omnidirectional Stereo Matching

AI-generated keywords: Depth Sensing

AI-generated Key Points

  • **Omnidirectional Stereo Matching (OSM)** is crucial for providing accurate $360^{\circ}$ depth information.
  • Existing state-of-the-art methods for stereo matching rely on complex 3D encoder-decoder blocks, leading to sub-optimal results.
  • The new algorithm **Recurrent Omnidirectional Stereo Matching (RomniStereo)** bridges the gap between OSM and RAFT by introducing an adaptive weighting scheme and incorporating grid embedding and adaptive context feature generation techniques.
  • RomniStereo outperforms previous methods by improving the average Mean Absolute Error metric by 40.7% across five datasets.
  • RomniStereo produces more accurate depth maps with fewer artifacts compared to other methods like OmniMVS+ in datasets such as OmniThings and OmniHouse, especially excelling in close-range regions crucial for robot navigation.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hualie Jiang, Rui Xu, Minglang Tan, Wenjie Jiang

accepted by IEEE RA-L, https://github.com/HalleyJiang/RomniStereo
License: CC BY-NC-SA 4.0

Abstract: Omnidirectional stereo matching (OSM) is an essential and reliable means for $360^{\circ}$ depth sensing. However, following earlier works on conventional stereo matching, prior state-of-the-art (SOTA) methods rely on a 3D encoder-decoder block to regularize the cost volume, causing the whole system complicated and sub-optimal results. Recently, the Recurrent All-pairs Field Transforms (RAFT) based approach employs the recurrent update in 2D and has efficiently improved image-matching tasks, \ie, optical flow, and stereo matching. To bridge the gap between OSM and RAFT, we mainly propose an opposite adaptive weighting scheme to seamlessly transform the outputs of spherical sweeping of OSM into the required inputs for the recurrent update, thus creating a recurrent omnidirectional stereo matching (RomniStereo) algorithm. Furthermore, we introduce two techniques, \ie, grid embedding and adaptive context feature generation, which also contribute to RomniStereo's performance. Our best model improves the average MAE metric by 40.7\% over the previous SOTA baseline across five datasets. When visualizing the results, our models demonstrate clear advantages on both synthetic and realistic examples. The code is available at \url{https://github.com/HalleyJiang/RomniStereo}.

Submitted to arXiv on 09 Jan. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2401.04345v1

, , , , In the field of depth sensing, <kw>Omnidirectional Stereo Matching (OSM)</kw> plays a crucial role in providing accurate and reliable $360^{\circ}$ depth information. However, existing state-of-the-art (SOTA) methods for stereo matching often rely on complex 3D encoder-decoder blocks to regularize cost volumes, leading to sub-optimal results. A recent approach based on <kw>Recurrent All-pairs Field Transforms (RAFT)</kw> has shown significant improvements in image-matching tasks like optical flow and stereo matching by employing recurrent updates in 2D. To bridge the gap between OSM and RAFT, a new algorithm called <kw>Recurrent Omnidirectional Stereo Matching (RomniStereo)</kw> is proposed. This innovative approach introduces an opposite adaptive weighting scheme to seamlessly transform the outputs of spherical sweeping from OSM into the required inputs for recurrent updates. Additionally, RomniStereo incorporates two novel techniques - grid embedding and adaptive context feature generation - further enhancing its performance. The RomniStereo algorithm outperforms previous SOTA methods by improving the average Mean Absolute Error (<kw>MAE</kw>) metric by 40.7% across five datasets. Visualizations of the results demonstrate clear advantages of RomniStereo over synthetic and realistic examples. The code for RomniStereo is publicly available at https://github.com/HalleyJiang/RomniStereo. Furthermore, qualitative comparisons show that RomniStereo produces more accurate depth maps with fewer artifacts compared to other methods like OmniMVS+ on datasets such as OmniThings and OmniHouse. In real-world scenarios, RomniStereo excels in producing cleaner and more accurate depth maps, especially in close-range regions crucial for robot navigation. Overall, RomniStereo offers a refined and efficient solution for <kw>omnidirectional stereo matching</kw>, combining the strengths of OSM with the advancements of RAFT to achieve superior depth sensing capabilities without sacrificing accuracy.
Created on 17 Jun. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.