YOLOX: Exceeding YOLO Series in 2021

AI-generated keywords: YOLOX Anchor-Free SimOTA COCO Autonomous Driving

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors present improvements to the YOLO series and introduce a new high-performance detector called YOLOX
  • Anchor-free approach and advanced detection techniques used in YOLOX
  • Achieves state-of-the-art results across various models
  • YOLO-Nano model achieves 25.3% AP on COCO, surpassing NanoDet by 1.8% AP
  • Enhanced YOLOv3 achieves 47.3% AP on COCO, outperforming current best practice by 3.0% AP
  • Introduce YOLOX-L with similar parameters as YOLOv4-CSP and YOLOv5-L, achieving 50.0% AP on COCO at 68.9 FPS on Tesla V100 GPU, surpassing YOLOv5-L by 1.8% AP
  • First place in Streaming Perception Challenge at CVPR 2021 using single YOLOX-L model for autonomous driving applications
  • Deploy versions of YOLOX available with support for ONNX, TensorRT NCNN, OpenVINO, and source code on GitHub
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, Jian Sun

Abstract: In this report, we present some experienced improvements to YOLO series, forming a new high-performance detector -- YOLOX. We switch the YOLO detector to an anchor-free manner and conduct other advanced detection techniques, i.e., a decoupled head and the leading label assignment strategy SimOTA to achieve state-of-the-art results across a large scale range of models: For YOLO-Nano with only 0.91M parameters and 1.08G FLOPs, we get 25.3% AP on COCO, surpassing NanoDet by 1.8% AP; for YOLOv3, one of the most widely used detectors in industry, we boost it to 47.3% AP on COCO, outperforming the current best practice by 3.0% AP; for YOLOX-L with roughly the same amount of parameters as YOLOv4-CSP, YOLOv5-L, we achieve 50.0% AP on COCO at a speed of 68.9 FPS on Tesla V100, exceeding YOLOv5-L by 1.8% AP. Further, we won the 1st Place on Streaming Perception Challenge (Workshop on Autonomous Driving at CVPR 2021) using a single YOLOX-L model. We hope this report can provide useful experience for developers and researchers in practical scenes, and we also provide deploy versions with ONNX, TensorRT, NCNN, and Openvino supported. Source code is at https://github.com/Megvii-BaseDetection/YOLOX.

Submitted to arXiv on 18 Jul. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2107.08430v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In this report titled "YOLOX: Exceeding YOLO Series in 2021," authors Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, and Jian Sun present their experienced improvements to the YOLO series and introduce a new high-performance detector called YOLOX. The researchers adopt an anchor-free approach for the YOLO detector and employ advanced detection techniques such as a decoupled head and the leading label assignment strategy SimOTA. These enhancements enable them to achieve state-of-the-art results across a wide range of models. The authors evaluate the performance of YOLOX on various models. For instance, they demonstrate that with only 0.91M parameters and 1.08G FLOPs, their YOLO-Nano model achieves an impressive 25.3% AP (average precision) on COCO (Common Objects in Context) dataset, surpassing NanoDet by 1.8% AP. They also enhance the widely used YOLOv3 detector to achieve a remarkable 47.3% AP on COCO, outperforming the current best practice by 3.0% AP. Furthermore, the researchers introduce YOLOX-L which has approximately the same number of parameters as YOLOv4-CSP and YOLOv5-L models. They achieve outstanding results with this model by attaining a 50.0% AP on COCO while maintaining a speed of 68.9 FPS (frames per second) on Tesla V100 GPU; surpassing YOLOv5-L by 1.8% AP. The authors highlight their success in winning the first place in the Streaming Perception Challenge at CVPR 2021 using a single YOLOX-L model for autonomous driving applications. Overall, this report provides valuable insights and experiences for developers and researchers working in practical scenes related to object detection; offering deploy versions of YOLOX with support for ONNX, TensorRT NCNN and OpenVINO as well as source code available on GitHub at https://github/MegviiBaseDetection/Yolox .
Created on 31 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.