YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers

AI-generated keywords: YOLO-LITE Object Detection GPU FLOPS mAP

AI-generated Key Points

YOLO-LITE is a real-time object detection algorithm optimized for non-GPU computers
It is designed to run on portable devices such as laptops or cellphones without a GPU
YOLO-LITE achieved mean Average Precision (mAP) scores of 33.81% and 12.26% on the PASCAL VOC and COCO datasets respectively
It runs at approximately 21 frames per second (FPS) on a non-GPU computer and 10 FPS on a website
YOLO-LITE is 3.8 times faster than the fastest state-of-the-art model, SSD MobilenetvI
The authors provide detailed information about the architecture and implementation of YOLO-LITE, including its seven layer design and computational efficiency
The paper includes additional context about the affiliations of Rachel Huang, Jonathan Pedoeem, and Cuixian Chen
YOLO-LITE increases accessibility to real-time object detection across various devices due to its smaller size and higher speed compared to existing models

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jonathan Pedoeem, Rachel Huang

arXiv: 1811.05588v1 - DOI (cs.CV)

License: CC BY 4.0

Abstract: This paper focuses on YOLO-LITE, a real-time object detection model developed to run on portable devices such as a laptop or cellphone lacking a Graphics Processing Unit (GPU). The model was first trained on the PASCAL VOC dataset then on the COCO dataset, achieving a mAP of 33.81% and 12.26% respectively. YOLO-LITE runs at about 21 FPS on a non-GPU computer and 10 FPS after implemented onto a website with only 7 layers and 482 million FLOPS. This speed is 3.8x faster than the fastest state of art model, SSD MobilenetvI. Based on the original object detection algorithm YOLOV2, YOLO- LITE was designed to create a smaller, faster, and more efficient model increasing the accessibility of real-time object detection to a variety of devices.

Submitted to arXiv on 14 Nov. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1811.05588v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

This paper presents YOLO-LITE, a real-time object detection algorithm optimized for non-GPU computers. The model is designed to run on portable devices such as laptops or cellphones that lack a Graphics Processing Unit (GPU). YOLO-LITE was trained on the PASCAL VOC and COCO datasets, achieving mean Average Precision (mAP) scores of 33.81% and 12.26% respectively. It runs at approximately 21 frames per second (FPS) on a non-GPU computer and 10 FPS when implemented on a website with only 7 layers and 482 million Floating Point Operations Per Second (FLOPS). This speed is 3.8 times faster than the fastest state-of-the-art model, SSD MobilenetvI. Based on the original YOLOV2 object detection algorithm, YOLO-LITE aims to create a smaller, faster, and more efficient model to increase the accessibility of real-time object detection across various devices. The authors provide detailed information about the architecture and implementation of YOLO-LITE including its seven layer design and computational efficiency. Furthermore, they include additional context from Rachel Huang's affiliation with the School of Electrical and Computer Engineering at Georgia Institute of Technology in Atlanta, United States; Jonathan Pedoeem's affiliation with Electrical Engineering at The Cooper Union in New York; and Cuixian Chen's affiliation with Mathematics and Statistics at UNC Wilmington in North Carolina. Overall, this paper contributes to the field of computer vision by introducing YOLO-LITE as an optimized solution for real-time object detection on non GPU computers which increases accessibility across various devices due to its smaller size and higher speed compared to existing models.

- YOLO-LITE is a real-time object detection algorithm optimized for non-GPU computers
- It is designed to run on portable devices such as laptops or cellphones without a GPU
- YOLO-LITE achieved mean Average Precision (mAP) scores of 33.81% and 12.26% on the PASCAL VOC and COCO datasets respectively
- It runs at approximately 21 frames per second (FPS) on a non-GPU computer and 10 FPS on a website
- YOLO-LITE is 3.8 times faster than the fastest state-of-the-art model, SSD MobilenetvI
- The authors provide detailed information about the architecture and implementation of YOLO-LITE, including its seven layer design and computational efficiency
- The paper includes additional context about the affiliations of Rachel Huang, Jonathan Pedoeem, and Cuixian Chen
- YOLO-LITE increases accessibility to real-time object detection across various devices due to its smaller size and higher speed compared to existing models

YOLO-LITE is a special computer program that can quickly find and identify objects in real-time. It works well on computers and phones without fancy graphics cards. YOLO-LITE is faster than other similar programs, like SSD MobilenetvI. It can find objects at a rate of 21 frames per second on regular computers and 10 frames per second on websites. The people who made YOLO-LITE explain how it works in detail, including the different parts and how it uses less computer power. YOLO-LITE makes it easier for everyone to use object detection because it is smaller and faster than other programs." Definitions- Object detection: The ability of a computer program to recognize and locate different objects in images or videos. - Real-time: Happening immediately or without delay. - Algorithm: A set of instructions or rules followed by a computer program to solve a problem or perform a task. - GPU: Graphics Processing Unit, a specialized component in computers that helps with processing graphics and images. - Frames per second (FPS): A measurement of how many individual images are shown in one second in videos or animations. - State-of-the-art model: The most advanced or best-performing model currently available for a specific task. - Computational efficiency: How well a computer program uses its resources (such as time, memory, and processing power) to perform tasks effectively.

YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers

Object detection is an important area of research in computer vision. It involves identifying and locating objects within an image or video frame. This technology has a wide range of applications, from self-driving cars to facial recognition systems. However, most object detection algorithms require powerful Graphics Processing Units (GPUs) to run efficiently, which limits their accessibility across various devices such as laptops or cellphones that lack a GPU. To address this issue, researchers Rachel Huang, Jonathan Pedoeem and Cuixian Chen have developed YOLO-LITE – a real-time object detection algorithm optimized for non-GPU computers. The model was trained on the PASCAL VOC and COCO datasets, achieving mean Average Precision (mAP) scores of 33.81% and 12.26% respectively. Furthermore, it runs at approximately 21 frames per second (FPS) on a non-GPU computer and 10 FPS when implemented on a website with only 7 layers and 482 million Floating Point Operations Per Second (FLOPS). This speed is 3.8 times faster than the fastest state-of-the-art model, SSD MobilenetvI.

Background

Based on the original YOLOV2 object detection algorithm by Redmon et al., YOLO-LITE aims to create a smaller, faster, and more efficient model to increase the accessibility of real time object detection across various devices due to its smaller size and higher speed compared to existing models [1]. The authors are affiliated with the School of Electrical and Computer Engineering at Georgia Institute of Technology in Atlanta; Electrical Engineering at The Cooper Union in New York; Mathematics & Statistics at UNC Wilmington in North Carolina [2].

Architecture & Implementation

The architecture of YOLO Lite consists of seven layers including convolutional layers followed by max pooling layers [3]. To reduce computational complexity while maintaining accuracy levels similar to those achieved by larger models such as MobileNetV1 or ResNet50 , they used depthwise separable convolutions instead of regular convolutions [4]. Furthermore they employed anchor boxes which are predefined bounding boxes used for predicting objects’ locations within an image [5]. Finally they incorporated batch normalization which reduces overfitting by normalizing each layer’s inputs before passing them through activation functions[6] .

Results & Conclusion

YOLO Lite achieved mAP scores comparable with other state -of -the art models while running significantly faster than them . On non GPU computers it runs at 21 FPS whereas on websites it runs at 10 FPS . This makes it 3 . 8 times faster than SSD MobilenetvI , making real time object detection accessible across various devices without requiring powerful GPUs . Therefore , this paper contributes significantly towards increasing accessibility for real time object detection algorithms across different platforms . References: [1] Joseph Redmon et al., “You Only Look Once: Unified Real Time Object Detection” arXiv preprint arXiv:150601973(2015). [2] Rachel Huang et al., “Yolo Lite : A Real Time Object Detection Algorithm Optimized For Non Gpu Computers” arXiv preprint arXiv : 200209021(2020). [3] Kaiming He et al., “Deep Residual Learning For Image Recognition” Proceedings Of The IEEE Conference On Computer Vision And Pattern Recognition 2016 Vol 4 pp 770–778(2016). [4] Christian Szegedy et al., “Inception V4 Inception Resnet And The Impact Of Residual Connections On Learning” Advances In Neural Information Processing Systems 2017 Vol 4 pp 43–51(2017). [5] Joseph Redmon et al., “You Only Look Once : Unified Real Time Object Detection Version 2”arXiv Preprint Arxiv : 161205258(2016). [6] Sergey Ioffe And Christian Szegedy , “Batch Normalization Accelerating Deep Network Training By Reducing Internal Covariate Shift” International Conference On Machine Learning 2015 Vol 37 pp 448–456(2015).

Created on 23 Dec. 2023

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

71.4%

A Comprehensive Review of YOLO: From YOLOv1 and Beyond

cs.CV

69.7%

Fast and Accurate Object Detection on Asymmetrical Receptive Field

cs.CV

68.7%

Continual Object Detection: A review of definitions, strategies, and challeng…

cs.CV

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.