Content area

Abstract

The objective of binaural direction of arrival (DoA) estimation is to find the DoA of a sound source by measuring the sound field with a binaural array. This field increasingly applies deep learning to this task, particularly convolutional neural networks which are trained on relatively raw representations of the binaural audio. 

This work investigates the field, establishing common trends among different publications, particularly in the data preparation, scrutinising these trends for instances of the emergence of collective wisdom without empirical backing. Based on this, an experimental evaluation is performed to gain insight into the efficacy of different existing and novel techniques, based on a recurring testing framework.

Such experimental evaluations are undertaken for several topics: an analysis of acoustic conditions on the performance of binaural DoA estimation, a broad empirical study on binaural feature representations to be used with convolutional neural networks (CNNs), the proposal and comparison of convolutional recurrent neural network (CRNN) models for binaural DoA estimation, and an investigation into binaural DoA estimation in the mismatched anechoic condition; referring to a mismatch in head-related transfer function (HRTF) measurements between training and testing datasets for an identical binaural array.

The findings in this thesis lead to recommendations for more effectively using deep neural networks for binaural DoA estimation, while also demonstrating the limited ability of such systems to generalise to unseen binaural data when using simulated binaural datasets which are limited in their scope.

Details

1010268
Business indexing term
Title
Deep Binaural Direction of Arrival Estimation An Experimental Analysis
Number of pages
256
Publication year
2025
Degree date
2025
School code
8357
Source
DAI-B 87/5(E), Dissertation Abstracts International
ISBN
9798263313890
University/institution
Liverpool John Moores University (United Kingdom)
University location
England
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
32406742
ProQuest document ID
3273605807
Document URL
https://www.proquest.com/dissertations-theses/deep-binaural-direction-arrival-estimation/docview/3273605807/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
ProQuest One Academic