Full text

Turn on search term navigation

© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Environmental sound recognition might play a crucial role in the development of autonomous vehicles by mimicking human behavior, particularly in complementing sight and touch to create a comprehensive sensory system. Just as humans rely on auditory cues to detect and respond to critical events such as emergency sirens, honking horns, or the approach of other vehicles and pedestrians, autonomous vehicles equipped with advanced sound recognition capabilities may significantly enhance their situational awareness and decision-making processes. To promote this approach, we extended the UrbanSound8K (US8K) dataset, a benchmark in urban sound classification research, by merging some classes deemed irrelevant for autonomous vehicles into a new class named ‘background’ and adding the class ‘silence’ sourced from Freesound.org to complement the dataset. This tailored dataset, named UrbanSound8K for Autonomous Vehicles (US8K_AV), contains 4.94 hours of annotated audio samples with 4,908 WAV files distributed among 6 classes. It supports the development of predictive models that can be deployed in embedded systems like Raspberry Pi.

Details

Title
A dataset for environmental sound recognition in embedded systems for autonomous vehicles
Author
Florentino, André Luiz 1   VIAFID ORCID Logo  ; Diniz, Eva Laussac 2 ; Aquino-Jr, Plinio Thomaz 1 

 Electric Engineering, Centro Universitário FEI - Fundação Educacional Inaciana Pe. Saboia de Medeiros, São Bernardo do Campo, Brazil (GRID:grid.440589.4) (ISNI:0000 0000 8607 7447) 
 Computer Science, UTFPR - Universidade Tecnológica Federal do Paraná, Cornélio Procópio, Brazil (GRID:grid.474682.b) (ISNI:0000 0001 0292 0044) 
Pages
1148
Publication year
2025
Publication date
2025
Publisher
Nature Publishing Group
e-ISSN
20524463
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3227340593
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.