REVISTA
Applied Sciences

TODAS

Redirigiendo al acceso original de articulo en 23 segundos...

Inicio / Applied Sciences / Vol: 9 Par: 10 (2019) / Art�culo

ART�CULO

TITULO

Enhanced Automatic Speech Recognition System Based on Enhancing Power-Normalized Cepstral Coefficients

Mohamed Tamazin

Ahmed Gouda and Mohamed Khedr

Resumen

Many new consumer applications are based on the use of automatic speech recognition (ASR) systems, such as voice command interfaces, speech-to-text applications, and data entry processes. Although ASR systems have remarkably improved in recent decades, the speech recognition system performance still significantly degrades in the presence of noisy environments. Developing a robust ASR system that can work in real-world noise and other acoustic distorting conditions is an attractive research topic. Many advanced algorithms have been developed in the literature to deal with this problem; most of these algorithms are based on modeling the behavior of the human auditory system with perceived noisy speech. In this research, the power-normalized cepstral coefficient (PNCC) system is modified to increase robustness against the different types of environmental noises, where a new technique based on gammatone channel filtering combined with channel bias minimization is used to suppress the noise effects. The TIDIGITS database is utilized to evaluate the performance of the proposed system in comparison to the state-of-the-art techniques in the presence of additive white Gaussian noise (AWGN) and seven different types of environmental noises. In this research, one word is recognized from a set containing 11 possibilities only. The experimental results showed that the proposed method provides significant improvements in the recognition accuracy at low signal to noise ratios (SNR). In the case of subway noise at SNR = 5 dB, the proposed method outperforms the mel-frequency cepstral coefficient (MFCC) and relative spectral (RASTA)?perceptual linear predictive (PLP) methods by 55% and 47%, respectively. Moreover, the recognition rate of the proposed method is higher than the gammatone frequency cepstral coefficient (GFCC) and PNCC methods in the case of car noise. It is enhanced by 40% in comparison to the GFCC method at SNR 0dB, while it is improved by 20% in comparison to the PNCC method at SNR -5dB.

Palabras claves

robust automatic speech recognition - ASR - feature extraction - MFCC - RASTA?PLP - GFCC - PNCC

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 9 Parte: 10 (2019)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Applied Sciences
Journal of Marine Science and Engineering
Aerospace

DOI

https://doi.org/10.3390/app9102166

Art�culos similares

Ship Trajectory Clustering Based on Trajectory Resampling and Enhanced BIRCH Algorithm

Acceso

Zhaojin Yan, Guanghao Yang, Rong He, Hui Yang, Hui Ci and Ran Wang

Automatic identification systems (AIS) provides massive ship trajectory data for maritime traffic management, route planning, and other research. In order to explore the valuable ship traffic characteristics contained implicitly in massive AIS data, a sh... ver m�s

Revista: Journal of Marine Science and Engineering

Multi-Hop Question Generation with Knowledge Graph-Enhanced Language Model

Acceso

Zhenping Li, Zhen Cao, Pengfei Li, Yong Zhong and Shaobo Li

The task of multi-hop question generation (QG) seeks to generate questions that require a complex reasoning process that spans multiple sentences and answers. Beyond the conventional challenges of what to ask and how to ask, multi-hop QG necessitates sop... ver m�s

Revista: Applied Sciences

Mining on Students? Execution Logs and Repairing Compilation Errors Based on Deep Learning

Acceso

Ruoyan Shi, Jianpeng Hu and Bo Lin

Automatic program repair techniques based on deep neural networks have attracted widespread attention from researchers due to the high degree of automation and generality. However, there is a scarcity of high-quality labeled datasets available for traini... ver m�s

Revista: Applied Sciences

HPC Platform for Railway Safety-Critical Functionalities Based on Artificial Intelligence

Acceso

Mikel Labayen, Laura Medina, Fernando Eizaguirre, Jos� Flich and Naiara Aginako

The automation of railroad operations is a rapidly growing industry. In 2023, a new European standard for the automated Grade of Automation (GoA) 2 over European Train Control System (ETCS) driving is anticipated. Meanwhile, railway stakeholders are alre... ver m�s

Revista: Applied Sciences

A Multiscale Local?Global Feature Fusion Method for SAR Image Classification with Bayesian Hyperparameter Optimization Algorithm

Acceso

Xiaoqin Lian, Xue Huang, Chao Gao, Guochun Ma, Yelan Wu, Yonggang Gong, Wenyang Guan and Jin Li

In recent years, the advancement of deep learning technology has led to excellent performance in synthetic aperture radar (SAR) automatic target recognition (ATR) technology. However, due to the interference of speckle noise, the task of classifying SAR ... ver m�s

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles