Inicio  /  Applied Sciences  /  Vol: 11 Par: 6 (2021)  /  Artículo
ARTÍCULO
TITULO

Dual-Mic Speech Enhancement Based on TF-GSC with Leakage Suppression and Signal Recovery

Hansol Kim and Jong Won Shin    

Resumen

The transfer function-generalized sidelobe canceller (TF-GSC) is one of the most popular structures for the adaptive beamformer used in multi-channel speech enhancement. Although the TF-GSC has shown decent performance, a certain amount of steering error is inevitable, which causes leakage of speech components through the blocking matrix (BM) and distortion in the fixed beamformer (FBF) output. In this paper, we propose to suppress the leaked signal in the output of the BM and restore the desired signal in the FBF output of the TF-GSC. To reduce the risk of attenuating speech in the adaptive noise canceller (ANC), the speech component in the output of the BM is suppressed by applying a gain function similar to the square-root Wiener filter, assuming that a certain portion of the desired speech should be leaked into the BM output. Additionally, we propose to restore the attenuated desired signal in the FBF output by adding some of the microphone signal components back, depending on how microphone signals are related to the FBF and BM outputs. The experimental results showed that the proposed TF-GSC outperformed conventional TF-GSC in terms of the perceptual evaluation of speech quality (PESQ) scores under various noise conditions and the direction of arrivals for the desired and interfering sources.

 Artículos similares

       
 
João Miguel Silva, Marco António Oliveira, André Ferraz Saraiva and Aníbal J. S. Ferreira    
The estimation of the frequency of sinusoids has been the object of intense research for more than 40 years. Its importance in classical fields such as telecommunications, instrumentation, and medicine has been extended to numerous specific signal proces... ver más
Revista: Acoustics

 
Giovanna Cardoso Pinto, Clayton Henrique Rocha, Carla Gentile Matas and Alessandra Giannella Samelli    
(1) Background: To assess and compare speech intelligibility with conventional and universal musician-specific hearing protection devices (HPD); (2) Methods: The sample comprised 15 normal-hearing musicians of both sexes who had been professionals for mo... ver más
Revista: Acoustics

 
Jonathan Miquel, Laurent Latorre and Simon Chamaillé-Jammes    
Biologging refers to the use of animal-borne recording devices to study wildlife behavior. In the case of audio recording, such devices generate large amounts of data over several months, and thus require some level of processing automation for the raw d... ver más

 
Lanting Li, Tianliang Lu, Xingbang Ma, Mengjiao Yuan and Da Wan    
In recent years, voice deepfake technology has developed rapidly, but current detection methods have the problems of insufficient detection generalization and insufficient feature extraction for unknown attacks. This paper presents a forged speech detect... ver más
Revista: Applied Sciences

 
Wondimu Lambamo, Ramasamy Srinivasagan and Worku Jifara    
The performance of speaker recognition systems is very well on the datasets without noise and mismatch. However, the performance gets degraded with the environmental noises, channel variation, physical and behavioral changes in speaker. The types of Spea... ver más
Revista: Applied Sciences