Full Text

Turn on search term navigation

© 2019. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

[...]the frequency spectrum of psycho-acoustic metrics is analyzed in terms of critical bands [16] with a frequency bandwidth matching the response of the mentioned auditory filters. [...]the whole database with 60,150 audio segments was divided into three datasets for training, validation, and test. For all the audio segments, we extracted by direct computation their corresponding PA value, to use the computed values as ground-truth annoyance during the training and validation of the system. Since calibration information is missing, we assumed a standard mapping to SPL as typically performed in audio coding. Observing the Convolutional layers and the Max-Pool layers, we see that they have data in the column “Stride”, this indicates us the number of elements that we move the filters before applying them again.

Details

Title
Computation of Psycho-Acoustic Annoyance Using Deep Neural Networks
Author
Lopez-Ballester, Jesus; Pastor-Aparicio, Adolfo; Segura-Garcia, Jaume; Felici-Castell, Santiago; Cobos, Maximo
Publication year
2019
Publication date
Jan 2019
Publisher
MDPI AG
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2323136518
Copyright
© 2019. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.