Abstract

Computer vision technology for detecting objects in a complex environment often includes other key technologies, including pattern recognition, artificial intelligence, and digital image processing. It has been shown that Fast Convolutional Neural Networks (CNNs) with You Only Look Once (YOLO) is optimal for differentiating similar objects, constant motion, and low image quality. The proposed study aims to resolve these issues by implementing three different object detection algorithms—You Only Look Once (YOLO), Single Stage Detector (SSD), and Faster Region-Based Convolutional Neural Networks (R-CNN). This paper compares three different deep-learning object detection methods to find the best possible combination of feature and accuracy. The R-CNN object detection techniques are performed better than single-stage detectors like Yolo (You Only Look Once) and Single Shot Detector (SSD) in term of accuracy, recall, precision and loss.

Details

Title
An improved deep learning-based optimal object detection system from images
Author
Yadav, Satya Prakash 1 ; Jindal, Muskan 2 ; Rani, Preeti 3 ; de Albuquerque, Victor Hugo C. 4 ; dos Santos Nascimento, Caio 4 ; Kumar, Manoj 5   VIAFID ORCID Logo 

 G.L. Bajaj Institute of Technology and Management (GLBITM), Department of Computer Science and Engineering, Greater Noida, India (GRID:grid.418403.a) (ISNI:0000 0001 0733 9339); Graduate Program in Telecommunications Engineering. (PPGET), Federal Institute of Education, Science, and Technology of Ceará (IFCE), Fortaleza, Brazil (GRID:grid.418403.a) 
 Amity University, Department of Computer Science and Engineering, Noida, India (GRID:grid.444644.2) (ISNI:0000 0004 1805 0217) 
 SRM Institute of Science and Technology, Department of Electronics & Communication Engineering, Modinagar, Ghaziabad, India (GRID:grid.412742.6) (ISNI:0000 0004 0635 5080) 
 Federal University of Ceará, Department of Teleinformatics Engineering, Fortaleza, Brazil (GRID:grid.8395.7) (ISNI:0000 0001 2160 0329) 
 University of Wollongong in Dubai, School of Computer Sceince, FEIS, Dubai, UAE (GRID:grid.444532.0) (ISNI:0000 0004 1763 6152); Middle East University, MEU Research Unit, Amman, Jordan (GRID:grid.449114.d) (ISNI:0000 0004 0457 5303) 
Pages
30045-30072
Publication year
2024
Publication date
Mar 2024
Publisher
Springer Nature B.V.
ISSN
13807501
e-ISSN
15737721
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2941425864
Copyright
© The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.