ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM

AI-generated keywords: ORB-SLAM3 SLAM Visual-Inertial Multi-Map Accuracy

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • ORB-SLAM3 is an advanced open-source library for simultaneous localization and mapping (SLAM) systems.
  • It supports visual, visual-inertial, and multi-map SLAM using various camera types.
  • The system operates in real time across different environments.
  • ORB-SLAM3 achieves 2 to 5 times more accuracy compared to previous methods.
  • It incorporates a multiple map system with improved recall capabilities.
  • ORB-SLAM3 seamlessly merges new maps with previously mapped areas.
  • Co-visible keyframes are included in bundle adjustment computations for enhanced accuracy.
  • Experimental evaluations show that ORB-SLAM3 performs robustly and surpasses the accuracy levels of state-of-the-art systems.
  • The stereo inertial SLAM implementation of ORB-SLAM3 is suitable for applications in augmented reality and virtual reality scenarios.
  • The source code of ORB-SLAM3 is publicly available for researchers and practitioners.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Carlos Campos, Richard Elvira, Juan J. Gómez Rodríguez, José M. M. Montiel, Juan D. Tardós

Abstract: This paper presents ORB-SLAM3, the first system able to perform visual, visual-inertial and multi-map SLAM with monocular, stereo and RGB-D cameras, using pin-hole and fisheye lens models. The first main novelty is a feature-based tightly-integrated visual-inertial SLAM system that fully relies on Maximum-a-Posteriori (MAP) estimation, even during the IMU initialization phase. The result is a system that operates robustly in real-time, in small and large, indoor and outdoor environments, and is 2 to 5 times more accurate than previous approaches. The second main novelty is a multiple map system that relies on a new place recognition method with improved recall. Thanks to it, ORB-SLAM3 is able to survive to long periods of poor visual information: when it gets lost, it starts a new map that will be seamlessly merged with previous maps when revisiting mapped areas. Compared with visual odometry systems that only use information from the last few seconds, ORB-SLAM3 is the first system able to reuse in all the algorithm stages all previous information. This allows to include in bundle adjustment co-visible keyframes, that provide high parallax observations boosting accuracy, even if they are widely separated in time or if they come from a previous mapping session. Our experiments show that, in all sensor configurations, ORB-SLAM3 is as robust as the best systems available in the literature, and significantly more accurate. Notably, our stereo-inertial SLAM achieves an average accuracy of 3.6 cm on the EuRoC drone and 9 mm under quick hand-held motions in the room of TUM-VI dataset, a setting representative of AR/VR scenarios. For the benefit of the community we make public the source code.

Submitted to arXiv on 23 Jul. 2020

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2007.11898v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

ORB-SLAM3 is an advanced open-source library that introduces several novel features to enhance the accuracy and robustness of simultaneous localization and mapping (SLAM) systems. It supports visual, visual-inertial, and multi-map SLAM using various camera types such as monocular, stereo, and RGB-D cameras with both pinhole and fisheye lens models. The first major innovation of ORB-SLAM3 is its tightly integrated visual-inertial SLAM system that relies on Maximum a Posteriori (MAP) estimation throughout the entire process even during the initialization phase of the inertial measurement unit (IMU). This approach enables the system to operate in real time across different environments from small indoor spaces to large outdoor areas. Furthermore, ORB-SLAM3 achieves a significant improvement in accuracy compared to previous methods with results showing it to be 2 to 5 times more accurate. Another key feature introduced by ORB-SLAM3 is its multiple map system which incorporates a new place recognition method with improved recall capabilities. As a result, ORB-SLAM3 can effectively handle long periods of poor visual information. When the system loses track or encounters challenging conditions it initiates a new map that seamlessly merges with previously mapped areas upon revisiting them. This capability allows ORB-SLAM3 to reuse all previous information at every stage of the algorithm instead of relying solely on recent data like traditional visual odometry systems. By including co visible keyframes in bundle adjustment computations which provide high parallax observations even if they are temporally distant or from previous mapping sessions ORB-SLAM3 achieves enhanced accuracy. Experimental evaluations demonstrate that ORB-SLAM3 performs as robustly as state of the art systems available in literature across all sensor configurations while significantly surpassing their accuracy levels. Notably, the stereo inertial SLAM implementation of ORB-SLAM3 achieves an average accuracy of 3.6 cm on the EuRoC drone dataset and 9 mm under quick hand held motions in the TUM VI room dataset making it suitable for applications in augmented reality (AR) and virtual reality (VR) scenarios. To benefit the research community, the authors have made the source code of ORB SLAM3 publicly available providing a valuable resource for researchers and practitioners working on visual, visual inertial and multi map SLAM systems.
Created on 19 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.