Redirigiendo al acceso original de articulo en 21 segundos...
Inicio  /  Information  /  Vol: 14 Par: 4 (2023)  /  Artículo
ARTÍCULO
TITULO

A Dual Stream Generative Adversarial Network with Phase Awareness for Speech Enhancement

Xintao Liang    
Yuhang Li    
Xiaomin Li    
Yue Zhang and Youdong Ding    

Resumen

Implementing single-channel speech enhancement under unknown noise conditions is a challenging problem. Most existing time-frequency domain methods are based on the amplitude spectrogram, and these methods often ignore the phase mismatch between noisy speech and clean speech, which largely limits the performance of speech enhancement. To solve the phase mismatch problem and further improve enhancement performance, this paper proposes a dual-stream Generative Adversarial Network (GAN) with phase awareness, named DPGAN. Our generator uses a dual-stream structure to predict amplitude and phase separately and adds an information communication module between the two streams to fully apply the phase information. To make the prediction more efficient, we apply Transformer to build the generator, which can learn the sound?s structural properties more easily. Finally, we designed a perceptually guided discriminator that quantitatively evaluates the quality of speech, optimising the generator for specific evaluation metrics. We conducted experiments on the most widely used Voicebank-DEMAND dataset and DPGAN achieved state-of-the-art on most metrics.

Palabras claves

 Artículos similares

       
 
Mélissa Férand, Thomas Livebardon, Stéphane Moreau and Marlène Sanjosé    
A hybrid methodology combining a detailed Large Eddy Simulation of a combustion chamber sector, an analytical propagation model of the extracted acoustic and entropy waves at the combustor exit through the turbine stages, and a far-field acoustic propaga... ver más
Revista: Acoustics

 
Daniel Guariglia, Alejandro Rubio Carpio and Christophe Schram    
Shock-cell noise occurs in aero-engines when the nozzle exhaust is supersonic and shock-cells are present in the jet. In commercial turbofan engines, at cruise, the secondary flow is often supersonic underexpanded, with the formation of annular shock-cel... ver más
Revista: Aerospace

 
Wei-Che Huang, Chih-Chieh Young and Wen-Cheng Liu    
An automated discharge imaging system (ADIS), which is a non-intrusive and safe approach, was developed for measuring river flows during flash flood events. ADIS consists of dual cameras to capture complete surface images in the near and far fields. Surf... ver más
Revista: Water