Abstract

Lip language is an effective method of voice-off communication in daily life for people with vocal cord lesions and laryngeal and lingual injuries without occupying the hands. Collection and interpretation of lip language is challenging. Here, we propose the concept of a novel lip-language decoding system with self-powered, low-cost, contact and flexible triboelectric sensors and a well-trained dilated recurrent neural network model based on prototype learning. The structural principle and electrical properties of the flexible sensors are measured and analysed. Lip motions for selected vowels, words, phrases, silent speech and voice speech are collected and compared. The prototype learning model reaches a test accuracy of 94.5% in training 20 classes with 100 samples each. The applications, such as identity recognition to unlock a gate, directional control of a toy car and lip-motion to speech conversion, work well and demonstrate great feasibility and potential. Our work presents a promising way to help people lacking a voice live a convenient life with barrier-free communication and boost their happiness, enriches the diversity of lip-language translation systems and will have potential value in many applications.

Lip-language decoding systems are a promising technology to help people lacking a voice live a convenient life with barrier-free communication. Here, authors propose a concept of such system integrating self-powered triboelectric sensors and a well-trained dilated RNN model based on prototype learning.

Details

Title
Decoding lip language using triboelectric sensors with deep learning
Author
Lu, Yijia 1   VIAFID ORCID Logo  ; Han, Tian 1 ; Cheng, Jia 1   VIAFID ORCID Logo  ; Zhu, Fei 2 ; Liu, Bin 1 ; Wei, Shanshan 1 ; Ji Linhong 1 ; Wang Zhong Lin 3   VIAFID ORCID Logo 

 Tsinghua University, State Key Laboratory of Tribology, Department of Mechanical Engineering, Beijing, China (GRID:grid.12527.33) (ISNI:0000 0001 0662 3178) 
 Chinese Academy of Sciences, National Laboratory of Pattern Recognition, Institute of Automation, Beijing, China (GRID:grid.9227.e) (ISNI:0000000119573309) 
 Chinese Academy of Sciences, Beijing Institute of Nanoenergy and Nanosystems, Beijing, China (GRID:grid.9227.e) (ISNI:0000000119573309); University of Chinese Academy of Sciences, School of Nanoscience and Technology, Beijing, China (GRID:grid.410726.6) (ISNI:0000 0004 1797 8419); Georgia Institute of Technology, School of Materials Science and Engineering, Atlanta, USA (GRID:grid.213917.f) (ISNI:0000 0001 2097 4943) 
Publication year
2022
Publication date
2022
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2640565157
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.