Abstract

Several attempts for speech brain–computer interfacing (BCI) have been made to decode phonemes, sub-words, words, or sentences using invasive measurements, such as the electrocorticogram (ECoG), during auditory speech perception, overt speech, or imagined (covert) speech. Decoding sentences from covert speech is a challenging task. Sixteen epilepsy patients with intracranially implanted electrodes participated in this study, and ECoGs were recorded during overt speech and covert speech of eight Japanese sentences, each consisting of three tokens. In particular, Transformer neural network model was applied to decode text sentences from covert speech, which was trained using ECoGs obtained during overt speech. We first examined the proposed Transformer model using the same task for training and testing, and then evaluated the model’s performance when trained with overt task for decoding covert speech. The Transformer model trained on covert speech achieved an average token error rate (TER) of 46.6% for decoding covert speech, whereas the model trained on overt speech achieved a TER of 46.3% (p>0.05;d=0.07). Therefore, the challenge of collecting training data for covert speech can be addressed using overt speech. The performance of covert speech can improve by employing several overt speeches.

Details

Title
Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech
Author
Komeiji, Shuji 1 ; Mitsuhashi, Takumi 2 ; Iimura, Yasushi 2 ; Suzuki, Hiroharu 2 ; Sugano, Hidenori 2 ; Shinoda, Koichi 3 ; Tanaka, Toshihisa 1 

 Tokyo University of Agriculture and Technology, Department of Electronic and Information Engineering, Koganei-shi, Japan (GRID:grid.136594.c) (ISNI:0000 0001 0689 5974) 
 Juntendo University School of Medicine, Department of Neurosurgery, Bunkyo-ku, Japan (GRID:grid.258269.2) (ISNI:0000 0004 1762 2738) 
 Tokyo Institute of Technology, Department of Computer Science, Meguro-ku, Japan (GRID:grid.32197.3e) (ISNI:0000 0001 2179 2105) 
Pages
11491
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3056928415
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.