Tech Science Press - Publisher of Open Access Journals

News & Announcements

23 April 2024
Revue Internationale de Géomatique (RIG) welcomes its new Editor-in-Chief Prof. Manchun Li
22 March 2024
Henderson Office Address Change Notification
19 March 2024
Frontiers in Heat and Mass Transfer Welcomes Prof. Chun Yang as Editor-in-Chief
24 January 2024
In Memoriam: Professor Kazuo Umezawa
15 January 2024
Tech Science Press Collaborates with STM to Promote Open Access Publishing
29 December 2023
Rising Talents in Engineering win CMES 2022 Young Researcher Award

Show export options

Articles
Online

Search Results (5)

Open Access

ARTICLE

Speech Separation Algorithm Using Gated Recurrent Network Based on Microphone Array

Xiaoyan Zhao^1,*, Lin Zhou², Yue Xie¹, Ying Tong¹, Jingang Shi³

Intelligent Automation & Soft Computing, Vol.36, No.3, pp. 3087-3100, 2023, DOI:10.32604/iasc.2023.030180

Abstract Speech separation is an active research topic that plays an important role in numerous applications, such as speaker recognition, hearing prosthesis, and autonomous robots. Many algorithms have been put forward to improve separation performance. However, speech separation in reverberant noisy environment is still a challenging task. To address this, a novel speech separation algorithm using gate recurrent unit (GRU) network based on microphone array has been proposed in this paper. The main aim of the proposed algorithm is to improve the separation performance and reduce the computational cost. The proposed algorithm extracts the sub-band steered response power-phase transform (SRP-PHAT) weighted… More >

View
717

Download
504

Like
0
Open Access

ARTICLE

Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

Ziyi Wang¹, Xiaoyan Zhao^1,*, Hongjun Rong¹, Ying Tong¹, Jingang Shi²

Journal of New Media, Vol.4, No.3, pp. 145-153, 2022, DOI:10.32604/jnm.2022.030178

Abstract Microphone array-based sound source localization (SSL) is widely used in a variety of occasions such as video conferencing, robotic hearing, speech enhancement, speech recognition and so on. The traditional SSL methods cannot achieve satisfactory performance in adverse noisy and reverberant environments. In order to improve localization performance, a novel SSL algorithm using convolutional residual network (CRN) is proposed in this paper. The spatial features including time difference of arrivals (TDOAs) between microphone pairs and steered response power-phase transform (SRP-PHAT) spatial spectrum are extracted in each Gammatone sub-band. The spatial features of different sub-bands with a frame are combine into a… More >

View
1116

Download
690

Like
0
Open Access

ARTICLE

Robust Sound Source Localization Using Convolutional Neural Network Based on Microphone Array

Xiaoyan Zhao^1,*, Lin Zhou², Ying Tong¹, Yuxiao Qi¹, Jingang Shi³

Intelligent Automation & Soft Computing, Vol.30, No.1, pp. 361-371, 2021, DOI:10.32604/iasc.2021.018823

Abstract In order to improve the performance of microphone array-based sound source localization (SSL), a robust SSL algorithm using convolutional neural network (CNN) is proposed in this paper. The Gammatone sub-band steered response power-phase transform (SRP-PHAT) spatial spectrum is adopted as the localization cue due to its feature correlation of consecutive sub-bands. Since CNN has the “weight sharing” characteristics and the advantage of processing tensor data, it is adopted to extract spatial location information from the localization cues. The Gammatone sub-band SRP-PHAT spatial spectrum are calculated through the microphone signals decomposed in frequency domain by Gammatone filters bank. The proposed algorithm… More >

View
1412

Download
1013

Like
0
Open Access

ARTICLE

Microphone Array Speech Separation Algorithm Based on TC-ResNet

Lin Zhou^1,*, Yue Xu¹, Tianyi Wang¹, Kun Feng¹, Jingang Shi²

CMC-Computers, Materials & Continua, Vol.69, No.2, pp. 2705-2716, 2021, DOI:10.32604/cmc.2021.017080

Abstract Traditional separation methods have limited ability to handle the speech separation problem in high reverberant and low signal-to-noise ratio (SNR) environments, and thus achieve unsatisfactory results. In this study, a convolutional neural network with temporal convolution and residual network (TC-ResNet) is proposed to realize speech separation in a complex acoustic environment. A simplified steered-response power phase transform, denoted as GSRP-PHAT, is employed to reduce the computational cost. The extracted features are reshaped to a special tensor as the system inputs and implements temporal convolution, which not only enlarges the receptive field of the convolution layer but also significantly reduces the… More >

View
1840

Download
1278

Like
1
Open Access

ARTICLE

Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network

Xiaoyan Zhao^{1, *}, Shuwen Chen², Lin Zhou³, Ying Chen^{3, 4}

CMC-Computers, Materials & Continua, Vol.64, No.1, pp. 253-271, 2020, DOI:10.32604/cmc.2020.09848

Abstract Microphone array-based sound source localization (SSL) is a challenging task in adverse acoustic scenarios. To address this, a novel SSL algorithm based on deep neural network (DNN) using steered response power-phase transform (SRP-PHAT) spatial spectrum as input feature is presented in this paper. Since the SRP-PHAT spatial power spectrum contains spatial location information, it is adopted as the input feature for sound source localization. DNN is exploited to extract the efficient location information from SRP-PHAT spatial power spectrum due to its advantage on extracting high-level features. SRP-PHAT at each steering position within a frame is arranged into a vector, which… More >

View
2884

Download
1402

Like
0

Cited by
4

Displaying 1-10 on page 1 of 5. Per Page

Speech Separation Algorithm Using Gated Recurrent Network Based on Microphone Array

View

717

Download

504

Like

0

Microphone Array-Based Sound Source Localization Using Convolutional Residual Network

View

1116

Download

690

Like

0

Robust Sound Source Localization Using Convolutional Neural Network Based on Microphone Array

View

1412

Download

1013

Like

0

Microphone Array Speech Separation Algorithm Based on TC-ResNet

View

1840

Download

1278

Like

1

Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network

View

2884

Download

1402

Like

0

Cited by

4

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: