Content area

Abstract

To improve the accuracy and precision of gesture recognition, this study improves YOLOv5 by incorporating a coordinate attention mechanism and a bidirectional feature pyramid network. Based on the improved YOLOv5, a static gesture recognition model is constructed. In addition, this study introduces a multimodal inter-frame motion attention weight module to enhance the model’s ability to recognize dynamic gestures. In the performance evaluation experiments, the proposed model achieves an area under the receiver operating characteristic curve of 0.94, a harmonic mean of 96.4%, and an intersection over union of 0.9. The accuracy of static gesture recognition reaches 100%, while the average accuracy of dynamic gesture recognition achieves 95.7%, which significantly outperforms the comparison models. These results demonstrate that the proposed gesture recognition model offers high accuracy for static gestures and reliable recognition performance for dynamic gestures. This approach provides a potential method and perspective for improving human–computer interaction in virtual reality and intelligent assistance scenarios.

Details

1009240
Title
Gesture recognition method integrating multimodal inter-frame motion and shared attention weights
Author
Lu, Qiyuan 1 

 Lanzhou City University, Art and Design School, Lanzhou, China (GRID:grid.464358.8) (ISNI:0000 0004 6479 2641) 
Publication title
Volume
5
Issue
1
Pages
405
Publication year
2025
Publication date
Dec 2025
Publisher
Springer Nature B.V.
Place of publication
Istanbul
Country of publication
Netherlands
e-ISSN
27310809
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-12-29
Milestone dates
2025-11-05 (Registration); 2025-07-29 (Received); 2025-11-05 (Accepted)
Publication history
 
 
   First posting date
29 Dec 2025
ProQuest document ID
3288264917
Document URL
https://www.proquest.com/scholarly-journals/gesture-recognition-method-integrating-multimodal/docview/3288264917/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-12-30
Database
ProQuest One Academic