Abstract

This study investigates the application of Convolutional Neural Networks (CNNs) in the domain of audio feature extraction and classification. Through systematic experimentation, diverse datasets spanning speech, music, and environmental sounds are utilized to train and evaluate CNN models. The statistical results demonstrate the efficacy of CNN-based approaches, with high accuracy, precision, recall, and F1-score achieved across various audio processing tasks, including speech recognition, music genre classification, and environmental sound monitoring. Comparative analysis against baseline models and alternative deep learning architectures reaffirms the superiority of CNNs, showcasing their ability to capture intricate patterns present in audio signals and overcome the limitations of traditional methods. Challenges such as dataset annotation, computational complexity, and robustness to noise are discussed, along with potential avenues for future research. Overall, this study contributes to the advancement of intelligent audio processing systems, highlighting the transformative potential of CNNs in unlocking new dimensions in auditory data analysis and interpretation.

Details

Title
Audio Feature Extraction and Classification Technology Based on Convolutional Neural Network
Author
Liu, Zhenfang 1 

 College of Arts and Sports, Henan Open University, Zhengzhou, Henan, 450046, China 
Pages
1425-1431
Publication year
2024
Publication date
2024
Publisher
Engineering and Scientific Research Groups
e-ISSN
11125209
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3081429578
Copyright
© 2024. This work is published under https://creativecommons.org/licenses/by/4.0/legalcode (the“License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.