Table of Content

Open Access iconOpen Access


Binaural Sound Source Localization Based on Convolutional Neural Network

Lin Zhou1,*, Kangyu Ma1, Lijie Wang1, Ying Chen1,2, Yibin Tang3

School of Information Science and Engineering, Southeast University, Nanjing, 210096, China.
Department of Psychiatry, Columbia University and NYSPI, New York, 10032, USA.
College of Internet of Things Engineering, Hohai University, Changzhou, 213022, China.

* Corresponding Author: Lin Zhou. Email: email.

Computers, Materials & Continua 2019, 60(2), 545-557.


Binaural sound source localization (BSSL) in low signal-to-noise ratio (SNR) and high reverberation environment is still a challenging task. In this paper, a novel BSSL algorithm is proposed by introducing convolutional neural network (CNN). The proposed algorithm first extracts the spatial feature of each sub-band from binaural sound signal, and then combines the features of all sub-bands within one frame to assemble a two-dimensional feature matrix as a grey image. To fully exploit the advantage of the CNN in extracting high-level features from the grey image, the spatial feature matrix of each frame is used as input to train the CNN model. The CNN is then used to predict azimuth of sound source. The experiments show that the proposed algorithm significantly improves the localization performance of BSSL in various acoustic environments, especially to deal with low SNR and high reverberation conditions.


Cite This Article

L. Zhou, K. Ma, L. Wang, Y. Chen and Y. Tang, "Binaural sound source localization based on convolutional neural network," Computers, Materials & Continua, vol. 60, no.2, pp. 545–557, 2019.


cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1996


  • 1530


  • 0


Related articles

Share Link