Content area

Abstract

Recently, the challenge of the boundary detection of isolated signs in a continuous sign video has been studied by researchers. To enhance the model performance, replace the handcrafted feature extractor, and also consider the hand structure in these models, we propose a deep learning-based approach, including a combination of the Graph Convolutional Network (GCN) and the Transformer models, along with a post-processing mechanism for final boundary detection. More specifically, the proposed approach includes two main steps: Pre-training on the isolated sign videos and Deploying on the continuous sign videos. In the first step, the enriched spatial features obtained from the GCN model are fed to the Transformer model to push the temporal information in the video stream. This model in pre-trained only using the pre-processed isolated sign videos with same frame lengths. During the second step, the sliding window method with the pre-defined window size is moved on the continuous sign video, including the un-processed isolated sign videos with different frame lengths. More concretely, the content of each window is processed using the pre-trained model obtained from the first step and the class probabilities of the Fully Connected (FC) layer embedded in the Transformer model are fed to the post-processing module, which aims to detect the accurate boundary of the un-processed isolated signs. In addition, we propose to present a non-anatomical graph structure to better present the hand joints movements and relations during the signing. Relying on the proposed non-anatomical hand graph structure as well as the self-attention mechanism in the Transformer model, the proposed model can successfully tackle the challenges of boundary detection in continuous sign videos. Experimental results on two datasets show the superiority of the proposed model in dealing with isolated sign boundary detection in continuous sign sequences.

Details

1009240
Title
A non-anatomical graph structure for boundary detection in continuous sign language
Author
Rastgoo, Razieh 1 ; Kiani, Kourosh 1 ; Escalera, Sergio 2 

 Semnan University, Electrical and Computer Engineering Department, Semnan, Iran (GRID:grid.412475.1) (ISNI:0000 0001 0506 807X) 
 University of Barcelona and Computer Vision Center, Department of Mathematics and Informatics, Barcelona, Spain (GRID:grid.5841.8) (ISNI:0000 0004 1937 0247) 
Volume
15
Issue
1
Pages
25683
Publication year
2025
Publication date
2025
Publisher
Nature Publishing Group
Place of publication
London
Country of publication
United States
Publication subject
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-07-16
Milestone dates
2025-07-11 (Registration); 2025-01-10 (Received); 2025-07-11 (Accepted)
Publication history
 
 
   First posting date
16 Jul 2025
ProQuest document ID
3230336632
Document URL
https://www.proquest.com/scholarly-journals/non-anatomical-graph-structure-boundary-detection/docview/3230336632/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-07-18
Database
ProQuest One Academic