SFSORT: Scene Features-based Simple Online Real-Time Tracker

AI-generated keywords: SFSORT Multi-object tracking Tracking-by-detection Bounding Box Similarity Index Scene features

AI-generated Key Points

  • SFSORT is introduced as the world's fastest multi-object tracking system based on experiments conducted on MOT Challenge datasets
  • The research aims to develop an accurate and computationally efficient tracker using a tracking-by-detection method within the online real-time tracking framework
  • Introduction of the novel cost function called the Bounding Box Similarity Index eliminates the need for Kalman Filter, reducing computational requirements while maintaining tracking accuracy
  • Exploration of scene features like scene depth and camera motion to enhance object-track association and improve track post-processing
  • SFSORT system comprises four main components: object detector, modules for associating high-score and moderate-score detections, and a track management module
  • Utilization of YOLOX as the object detector model ensures high tracking accuracy
  • Introduction of a camera motion detector and an efficient metric for estimating scene depth to enhance post-processing of tracks
  • Impressive performance metrics achieved by SFSORT on MOT Challenge datasets: HOTA of 61.7% with processing speed of 2242 Hz on MOT17 dataset and 60.9% with processing speed of 304 Hz on MOT20 dataset
  • First paper to consider scene features in track post-processing by introducing innovative techniques for camera motion detection and depth estimation
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: M. M. Morsali, Z. Sharifi, F. Fallah, S. Hashembeiki, H. Mohammadzade, S. Bagheri Shouraki

License: CC BY 4.0

Abstract: This paper introduces SFSORT, the world's fastest multi-object tracking system based on experiments conducted on MOT Challenge datasets. To achieve an accurate and computationally efficient tracker, this paper employs a tracking-by-detection method, following the online real-time tracking approach established in prior literature. By introducing a novel cost function called the Bounding Box Similarity Index, this work eliminates the Kalman Filter, leading to reduced computational requirements. Additionally, this paper demonstrates the impact of scene features on enhancing object-track association and improving track post-processing. Using a 2.2 GHz Intel Xeon CPU, the proposed method achieves an HOTA of 61.7\% with a processing speed of 2242 Hz on the MOT17 dataset and an HOTA of 60.9\% with a processing speed of 304 Hz on the MOT20 dataset. The tracker's source code, fine-tuned object detection model, and tutorials are available at \url{https://github.com/gitmehrdad/SFSORT}.

Submitted to arXiv on 11 Apr. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2404.07553v1

In this paper titled "SFSORT: Scene Features-based Simple Online Real-Time Tracker," authors M. M. Morsali, Z. Sharifi, F. Fallah, S. Hashembeiki, H. Mohammadzade, and S. Bagheri Shouraki introduce SFSORT as the world's fastest multi-object tracking system based on experiments conducted on MOT Challenge datasets. The primary goal of this research is to develop an accurate and computationally efficient tracker by employing a tracking-by-detection method within the online real-time tracking framework. One of the key contributions of this work is the introduction of a novel cost function called the Bounding Box Similarity Index which eliminates the need for the Kalman Filter and significantly reduces computational requirements while maintaining tracking accuracy. Additionally, this paper explores the impact of scene features such as scene depth and camera motion on enhancing object-track association and improving track post-processing. The proposed SFSORT system consists of four main components: an object detector, modules for associating high-score and moderate-score detections, and a track management module. By processing information from frame T along with tracks from frame T-1, SFSORT generates a list of tracks for each frame in real-time. The object detector used in this system is based on YOLOX, a state-of-the-art object detection model that ensures high tracking accuracy. The authors also introduce a camera motion detector and an efficient metric for estimating scene depth to enhance post-processing of tracks. Experimental results demonstrate that SFSORT achieves impressive performance metrics on MOT Challenge datasets - achieving an HOTA (Higher Order Tracking Accuracy) of 61.7% with a processing speed of 2242 Hz on the MOT17 dataset and 60.9% with a processing speed of 304 Hz on the MOT20 dataset. Furthermore, this paper is significant as it is the first to consider scene features in track post-processing by introducing innovative techniques for camera motion detection and depth estimation. In conclusion, "SFSORT: Scene Features-based Simple Online Real-Time Tracker" presents a cutting-edge approach to multi-object tracking that combines advanced object detection models with novel cost functions and scene feature analysis to achieve state-of-the-art performance in terms of both accuracy and computational efficiency.
Created on 17 Aug. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.