Inicio  /  Applied Sciences  /  Vol: 9 Par: 16 (2019)  /  Artículo
ARTÍCULO
TITULO

Speech Enhancement Using Generative Adversarial Network by Distilling Knowledge from Statistical Method

Jianfeng Wu    
Yongzhu Hua    
Shengying Yang    
Hongshuai Qin and Huibin Qin    

Resumen

This paper presents a new deep neural network (DNN)-based speech enhancement algorithm by integrating the distilled knowledge from the traditional statistical-based method. Unlike the other DNN-based methods, which usually train many different models on the same data and then average their predictions, or use a large number of noise types to enlarge the simulated noisy speech, the proposed method does not train a whole ensemble of models and does not require a mass of simulated noisy speech. It first trains a discriminator network and a generator network simultaneously using the adversarial learning method. Then, the discriminator network and generator network are re-trained by distilling knowledge from the statistical method, which is inspired by the knowledge distillation in a neural network. Finally, the generator network is fine-tuned using real noisy speech. Experiments on CHiME4 data sets demonstrate that the proposed method achieves a more robust performance than the compared DNN-based method in terms of perceptual speech quality.

 Artículos similares

       
 
Eran Shachar, Israel Cohen and Baruch Berdugo    
This paper presents an echo suppression system that combines a linear acoustic echo canceller (AEC) with a deep complex convolutional recurrent network (DCCRN) for residual echo suppression. The filter taps of the AEC are adjusted in subbands by using th... ver más
Revista: Algorithms

 
Xintao Liang, Yuhang Li, Xiaomin Li, Yue Zhang and Youdong Ding    
Implementing single-channel speech enhancement under unknown noise conditions is a challenging problem. Most existing time-frequency domain methods are based on the amplitude spectrogram, and these methods often ignore the phase mismatch between noisy sp... ver más
Revista: Information

 
Vasundhara Shukla and Preety D. Swami    
This paper introduces a novel speech enhancement approach called dominant columns group orthogonalization of the sensing matrix (DCGOSM) in compressive sensing (CS). DCGOSM optimizes the sensing matrix using particle swarm optimization (PSO), ensuring se... ver más
Revista: Applied Sciences

 
Shuiyan Li, Rongzhi Qi and Shengnan Zhang    
Compared with English named entity recognition (NER), Chinese NER faces significant challenges due to the flexible, non-standard word formation and vague word boundaries, which cause a lot of boundary ambiguity and reduce the accuracy of entity identific... ver más
Revista: Applied Sciences

 
Fengqi Tan, Changchun Bao and Jing Zhou    
Adaptive beamforming and deconvolution techniques have shown effectiveness for reducing noise and reverberation. The minimum variance distortionless response (MVDR) beamformer is the most widely used for adaptive beamforming, whereas multichannel linear ... ver más
Revista: Applied Sciences