Comparison of You Only Look Once (YOLO) and Single Shot Multibox Detector (SSD) Methods for Object Detection Using OpenCV
Keywords:
Object Detection, YOLO, SSD, OpenCV, Real-Time, Accuracy, Processing Speed, Resource EfficiencyAbstract
Object detection is a crucial field in image processing and computer vision, playing a vital role in applications such as security, industrial automation, human-computer interaction, and autonomous vehicles. As the demand grows for fast, accurate, and real-time systems, deep learning-based object detection methods have become a primary choice. This study aims to evaluate and compare the performance of two popular methods, You Only Look Once (YOLO) and Single Shot MultiBox Detector (SSD), using OpenCV as the development platform. The evaluation focuses on key parameters, including detection accuracy, processing speed, resource efficiency, and real-time applicability. The COCO dataset was used for testing, as it offers diverse object categories with variations in lighting conditions, background, and object sizes. Pre-trained YOLOv3 and SSD models were implemented using Python. The results indicate that YOLO achieves higher processing speed, with an average frame rate of up to 45 fps, making it ideal for fast-response applications such as surveillance systems. Conversely, SSD demonstrates better accuracy in detecting small objects, although its processing speed is slightly lower than YOLO. Further analysis reveals that YOLO relies heavily on GPU for optimizing processing speed, whereas SSD depends more on CPU, making it suitable for devices with limited resources. This study provides valuable insights for developers and researchers in selecting the most appropriate object detection method for specific needs, whether for real-time applications or scenarios requiring high accuracy. Understanding the strengths and limitations of each method enables optimized object detection technology for various future applications.References
Redmon, J., & Farhadi, A., "YOLOv3: An Incremental Improvement," 2018. [Online]. Available: http://arxiv.org/abs/1804.02767.
I.S. Dewi, (2016). "Deep Learning Object Detection Pada Video Menggunakan Tensorflow Dan Convolutional Neural Network", pp. 1-60.
Bochkovskiy, A., Wang, C. Y., Liao, H. Y. M., (2020). "YOLOv4: Optimal Speed and Accuracy of Object Detection", arXiv:2004.1093v1.
Baojun Zhang, Guili Wang , Huilan Wang, Chenchen Xu, Yu Li, dan Lin Xu, (2021). "Detecting Small Chinese Traffic Signs via Improved," Hindawi Mathematical Problems in Engineering, p. 10.
Redmon, J., Divvala, S., Girshick, R., Farhadi, A., (2016). You Only Look Once: Unified, RealTime Object Detection, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788.
Adidama, M. R., Fadhil, M. R., & Samsinar, R. (2025). Deteksi Barang Hilang Berbasis Deteksi Visual dengan Metode You Only Look Once. In Seminar Nasional Teknologi & Sains (Vol. 4, No. 1, pp. 653-659)..
H. M. Alfarisi, (2020). "Yolov4: Teknologi Terbaru dalam Perkembangan Algoritma Object Detection,"
Gautama, T. K., Hendrik, A., & Hendaya, R. (2020), "Pengenalan Objek pada Computer Vision dengan Pencocokan Fitur Menggunakan Algoritma SIFT Studi Kasus: Deteksi Penyakit Kulit Sederhana," Jurnal Teknik Informatika Dan Sistem Informasi, pp. 437-450.
Muchtar, Husnibes; Rizky, Apriyadi, (2019). "Implementasi Pengenalan Wajah Pada Sistem Penguncian Rumah dengan Metode Template Matching Menggunakan Open Source Computer Vision Library (Opencv)," RESISTOR (elektRonika kEndali telekomunikaSI tenaga liSTrik kOmputeR) Vol. 2 No. 1.