REVISTA
Applied Sciences

TODAS

Inicio / Applied Sciences / Vol: 13 Par: 3 (2023) / Art�culo

ART�CULO

TITULO

Speech Enhancement Based on Two-Stage Processing with Deep Neural Network for Laser Doppler Vibrometer

Chengkai Cai

Kenta Iwai and Takanobu Nishiura

Resumen

The development of distant-talk measurement systems has been attracting attention since they can be applied to many situations such as security and disaster relief. One such system that uses a device called a laser Doppler vibrometer (LDV) to acquire sound by measuring an object?s vibration caused by the sound source has been proposed. Different from traditional microphones, an LDV can pick up the target sound from a distance even in a noisy environment. However, the acquired sounds are greatly distorted due to the object?s shape and frequency response. Due to the particularity of the degradation of observed speech, conventional methods cannot be effectively applied to LDVs. We propose two speech enhancement methods that are based on two-stage processing with deep neural networks for LDVs. With the first proposed method, the amplitude spectrum of the observed speech is first restored. The phase difference between the observed and clean speech is then estimated using the restored amplitude spectrum. With the other proposed method, the low-frequency components of the observed speech are first restored. The high-frequency components are then estimated by the restored low-frequency components. The evaluation results indicate that they improved the observed speech in sound quality, deterioration degree, and intelligibility.

Palabras claves

distant-talking speech measurement - speech enhancement - deep neural network - laser Doppler vibrometer

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 13 Parte: 3 (2023)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Applied Sciences
Aerospace
Algorithms

DOI

https://doi.org/10.3390/app13031958

Art�culos similares

A Dual Stream Generative Adversarial Network with Phase Awareness for Speech Enhancement

Acceso

Xintao Liang, Yuhang Li, Xiaomin Li, Yue Zhang and Youdong Ding

Implementing single-channel speech enhancement under unknown noise conditions is a challenging problem. Most existing time-frequency domain methods are based on the amplitude spectrogram, and these methods often ignore the phase mismatch between noisy sp... ver m�s

Revista: Information

Orthogonalization of the Sensing Matrix Through Dominant Columns in Compressive Sensing for Speech Enhancement

Acceso

Vasundhara Shukla and Preety D. Swami

This paper introduces a novel speech enhancement approach called dominant columns group orthogonalization of the sensing matrix (DCGOSM) in compressive sensing (CS). DCGOSM optimizes the sensing matrix using particle swarm optimization (PSO), ensuring se... ver m�s

Revista: Applied Sciences

Chinese Named Entity Recognition Based on Boundary Enhancement with Multi-Class Information

Acceso

Shuiyan Li, Rongzhi Qi and Shengnan Zhang

Compared with English named entity recognition (NER), Chinese NER faces significant challenges due to the flexible, non-standard word formation and vague word boundaries, which cause a lot of boundary ambiguity and reduce the accuracy of entity identific... ver m�s

Revista: Applied Sciences

Speech Enhancement Framework with Noise Suppression Using Block Principal Component Analysis

Acceso

Abdullah Zaini Alsheibi, Kimon P. Valavanis, Asif Iqbal and Muhammad Naveed Aman

With the advancement in voice-communication-based human?machine interface technology in smart home devices, the ability to decompose the received speech signal into a signal of interest and an interference component has emerged as a key requirement for t... ver m�s

Revista: Acoustics

Effective Dereverberation with a Lower Complexity at Presence of the Noise

Acceso

Fengqi Tan, Changchun Bao and Jing Zhou

Adaptive beamforming and deconvolution techniques have shown effectiveness for reducing noise and reverberation. The minimum variance distortionless response (MVDR) beamformer is the most widely used for adaptive beamforming, whereas multichannel linear ... ver m�s

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles