Full Text

Turn on search term navigation

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Using gestures can help people with certain disabilities in communicating with other people. This paper proposes a lightweight model based on YOLO (You Only Look Once) v3 and DarkNet-53 convolutional neural networks for gesture recognition without additional preprocessing, image filtering, and enhancement of images. The proposed model achieved high accuracy even in a complex environment, and it successfully detected gestures even in low-resolution picture mode. The proposed model was evaluated on a labeled dataset of hand gestures in both Pascal VOC and YOLO format. We achieved better results by extracting features from the hand and recognized hand gestures of our proposed YOLOv3 based model with accuracy, precision, recall, and an F-1 score of 97.68, 94.88, 98.66, and 96.70%, respectively. Further, we compared our model with Single Shot Detector (SSD) and Visual Geometry Group (VGG16), which achieved an accuracy between 82 and 85%. The trained model can be used for real-time detection, both for static hand images and dynamic gestures recorded on a video.

Details

Title
Real-Time Hand Gesture Recognition Based on Deep Learning YOLOv3 Model
Author
Mujahid, Abdullah 1   VIAFID ORCID Logo  ; Mazhar Javed Awan 2   VIAFID ORCID Logo  ; Yasin, Awais 3 ; Mazin Abed Mohammed 4   VIAFID ORCID Logo  ; Damaševičius, Robertas 5   VIAFID ORCID Logo  ; Maskeliūnas, Rytis 6   VIAFID ORCID Logo  ; Karrar Hameed Abdulkareem 7   VIAFID ORCID Logo 

 Department of Computer Science, University of Management and Technology, Lahore 54770, Pakistan; [email protected] 
 Department of Software Engineering, University of Management and Technology, Lahore 54770, Pakistan; [email protected] 
 Department of Computer Engineering, National University of Technology, Islamabad 44000, Pakistan; [email protected] 
 Information Systems Department, College of Computer Science and Information Technology, University of Anbar, Anbar 31001, Iraq; [email protected] 
 Faculty of Applied Mathematics, Silesian University of Technology, 44-100 Gliwice, Poland 
 Department of Applied Informatics, Vytautas Magnus University, 44404 Kaunas, Lithuania; [email protected] 
 College of Agriculture, Al-Muthanna University, Samawah 66001, Iraq; [email protected] 
First page
4164
Publication year
2021
Publication date
2021
Publisher
MDPI AG
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2528271092
Copyright
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.