Auditory Cortex-Inspired Spectral Attention Modulation for Binaural Sound Localization in HRTF Mismatch - Institut de Recherche et Coordination Acoustique/Musique Access content directly
Conference Papers Year : 2024

Auditory Cortex-Inspired Spectral Attention Modulation for Binaural Sound Localization in HRTF Mismatch

Abstract

In applications like noise cancellation and virtual reality, precise sound source localization is crucial. Existing datadriven binaural systems offer high performance in adverse conditions such as noise and reverberation but face limitations with real-time operation and performance degradation in HRTF mismatch scenarios. Our work introduces a compact Vision Transformer tailored to address these issues, with a primary focus on horizontal speech localization. Inspired by the auditory cortex, our model uniquely incorporates spectral attention mechanisms using encoded speech representations. This architecture enhances generalization on the azimuth plane under mismatched HRTFs. Our empirical results show a marked improvement over conventional DNN, CNN-based and Transformer-based models, both in noisy and noise-free environments. Significantly, the proposed model maintains high accuracy in localizing adjacent azimuths, ideal for realworld applications.
Fichier principal
Vignette du fichier
ICASSP2024_FINAL.pdf (231.32 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-04416122 , version 1 (29-01-2024)

Identifiers

  • HAL Id : hal-04416122 , version 1

Cite

Waradon Phokhinanan, Nicolas Obin, Sylvain Argentieri. Auditory Cortex-Inspired Spectral Attention Modulation for Binaural Sound Localization in HRTF Mismatch. International Conference on Acoustics, Speech, and Signal Processing, IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨hal-04416122⟩
35 View
21 Download

Share

Gmail Facebook X LinkedIn More