Dual-Mic Speech Enhancement Based on TF-GSC with Leakage Suppression and Signal Recovery

Hansol Kim and Jong Won Shin

Resumen

The transfer function-generalized sidelobe canceller (TF-GSC) is one of the most popular structures for the adaptive beamformer used in multi-channel speech enhancement. Although the TF-GSC has shown decent performance, a certain amount of steering error is inevitable, which causes leakage of speech components through the blocking matrix (BM) and distortion in the fixed beamformer (FBF) output. In this paper, we propose to suppress the leaked signal in the output of the BM and restore the desired signal in the FBF output of the TF-GSC. To reduce the risk of attenuating speech in the adaptive noise canceller (ANC), the speech component in the output of the BM is suppressed by applying a gain function similar to the square-root Wiener filter, assuming that a certain portion of the desired speech should be leaked into the BM output. Additionally, we propose to restore the attenuated desired signal in the FBF output by adding some of the microphone signal components back, depending on how microphone signals are related to the FBF and BM outputs. The experimental results showed that the proposed TF-GSC outperformed conventional TF-GSC in terms of the perceptual evaluation of speech quality (PESQ) scores under various noise conditions and the direction of arrivals for the desired and interfering sources.

Palabras claves

dual-mic speech enhancement - transfer function-generalized sidelobe canceller - steering error - leakage suppression - signal recovery

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 11 Parte: 6 (2021)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Algorithms
Applied Sciences
Information

DOI

https://doi.org/10.3390/app11062816

Art�culos similares

One-Step Discrete Fourier Transform-Based Sinusoid Frequency Estimation under Full-Bandwidth Quasi-Harmonic Interference

Acceso

Jo�o Miguel Silva, Marco Ant�nio Oliveira, Andr� Ferraz Saraiva and An�bal J. S. Ferreira

The estimation of the frequency of sinusoids has been the object of intense research for more than 40 years. Its importance in classical fields such as telecommunications, instrumentation, and medicine has been extended to numerous specific signal proces... ver m�s

Revista: Acoustics

Effects of Conventional and Musician-Specific Hearing Protection Devices on Speech Intelligibility

Acceso

Giovanna Cardoso Pinto, Clayton Henrique Rocha, Carla Gentile Matas and Alessandra Giannella Samelli

(1) Background: To assess and compare speech intelligibility with conventional and universal musician-specific hearing protection devices (HPD); (2) Methods: The sample comprised 15 normal-hearing musicians of both sexes who had been professionals for mo... ver m�s

Revista: Acoustics

Energy-Efficient Audio Processing at the Edge for Biologging Applications

Acceso

Jonathan Miquel, Laurent Latorre and Simon Chamaill�-Jammes

Biologging refers to the use of animal-borne recording devices to study wildlife behavior. In the case of audio recording, such devices generate large amounts of data over several months, and thus require some level of processing automation for the raw d... ver m�s

Revista: Journal of Low Power Electronics and Applications

Voice Deepfake Detection Using the Self-Supervised Pre-Training Model HuBERT

Acceso

Lanting Li, Tianliang Lu, Xingbang Ma, Mengjiao Yuan and Da Wan

In recent years, voice deepfake technology has developed rapidly, but current detection methods have the problems of insufficient detection generalization and insufficient feature extraction for unknown attacks. This paper presents a forged speech detect... ver m�s

Revista: Applied Sciences

Analyzing Noise Robustness of Cochleogram and Mel Spectrogram Features in Deep Learning Based Speaker Recognition

Acceso

Wondimu Lambamo, Ramasamy Srinivasagan and Worku Jifara

The performance of speaker recognition systems is very well on the datasets without noise and mismatch. However, the performance gets degraded with the environmental noises, channel variation, physical and behavioral changes in speaker. The types of Spea... ver m�s

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles