Full Text

Turn on search term navigation

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Speaker identification systems perform almost ideally in neutral talking environments. However, these systems perform poorly in stressful talking environments. In this paper, we present an effective approach for enhancing the performance of speaker identification in stressful talking environments based on a novel radial basis function neural network-convolutional neural network (RBFNN-CNN) model. In this research, we applied our approach to two distinct speech databases: a local Arabic Emirati-accent dataset and a global English Speech Under Simulated and Actual Stress (SUSAS) corpus. To the best of our knowledge, this is the first work that addresses the use of an RBFNN-CNN model in speaker identification under stressful talking environments. Our speech identification models select the finest speech signal representation through the use of Mel-frequency cepstral coefficients (MFCCs) as a feature extraction method. A comparison among traditional classifiers such as support vector machine (SVM), multilayer perceptron (MLP), k-nearest neighbors algorithm (KNN) and deep learning models, such as convolutional neural network (CNN) and recurrent neural network (RNN), was conducted. The results of our experiments show that speaker identification performance in stressful environments based on the RBFNN-CNN model is higher than that with the classical and deep machine learning models.

Details

Title
A Novel RBFNN-CNN Model for Speaker Identification in Stressful Talking Environments
Author
Ali Bou Nassif 1   VIAFID ORCID Logo  ; Alnazzawi, Noha 2   VIAFID ORCID Logo  ; Ismail Shahin 3 ; Salloum, Said A 4 ; Hindawi, Noor 3 ; Lataifeh, Mohammed 5 ; Elnagar, Ashraf 5 

 Computer Engineering Department, University of Sharjah, Sharjah P.O. Box 27272, United Arab Emirates 
 Computer Science and Engineering Department, Yanbu University College, Royal Commission for Jubail and Yanbu, Yanbu Industrial City, Yanbu 46435, Saudi Arabia; [email protected] 
 Electrical Engineering Department, University of Sharjah, Sharjah P.O. Box 27272, United Arab Emirates; [email protected] (I.S.); [email protected] (N.H.) 
 School of Science, Engineering, and Environment, University of Salford, Salford M5 4WT, UK; [email protected] 
 Computer Science Department, University of Sharjah, Sharjah P.O. Box 27272, United Arab Emirates; [email protected] (M.L.); [email protected] (A.E.) 
First page
4841
Publication year
2022
Publication date
2022
Publisher
MDPI AG
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2670082091
Copyright
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.