Inicio  /  Applied Sciences  /  Vol: 9 Par: 9 (2019)  /  Artículo
ARTÍCULO
TITULO

Regularized Urdu Speech Recognition with Semi-Supervised Deep Learning

Mohammad Ali Humayun    
Ibrahim A. Hameed    
Syed Muslim Shah    
Sohaib Hassan Khan    
Irfan Zafar    
Saad Bin Ahmed and Junaid Shuja    

Resumen

Automatic Speech Recognition, (ASR) has achieved the best results for English, with end-to-end neural network based supervised models. These supervised models need huge amounts of labeled speech data for good generalization, which can be quite a challenge to obtain for low-resource languages like Urdu. Most models proposed for Urdu ASR are based on Hidden Markov Models (HMMs). This paper proposes an end-to-end neural network model, for Urdu ASR, regularized with dropout, ensemble averaging and Maxout units. Dropout and ensembles are averaging techniques over multiple neural network models while Maxout are units in a neural network which adapt their activation functions. Due to limited labeled data, Semi Supervised Learning (SSL) techniques are also incorporated to improve model generalization. Speech features are transformed into a lower dimensional manifold using an unsupervised dimensionality-reduction technique called Locally Linear Embedding (LLE). Transformed data along with higher dimensional features is used to train neural networks. The proposed model also utilizes label propagation-based self-training of initially trained models and achieves a Word Error Rate (WER) of 4% less than that reported as the benchmark on the same Urdu corpus using HMM. The decrease in WER after incorporating SSL is more significant with an increased validation data size.

 Artículos similares

       
 
Donghae Baek, Il Won Seo, Jun Song Kim, Sung Hyun Jung and Yuyoung Choi    
The dispersion coefficients are crucial in understanding the spreading of pollutant clouds in river flows, particularly in the context of the depth-averaged two-dimensional (2D) advection?dispersion equation (ADE). Traditionally, the 2D stream-tube routi... ver más
Revista: Water

 
Yong Liu, Xiaohui Yan, Wenying Du, Tianqi Zhang, Xiaopeng Bai and Ruichuan Nan    
The current work proposes a novel super-resolution convolutional transposed network (SRCTN) deep learning architecture for downscaling daily climatic variables. The algorithm was established based on a super-resolution convolutional neural network with t... ver más
Revista: Water

 
JongBae Kim    
This technology can prevent accidents involving large vehicles, such as trucks or buses, by selecting an optimal driving lane for safe autonomous driving. This paper proposes a method for detecting forward-driving vehicles within road images obtained fro... ver más
Revista: Applied Sciences

 
Han Zhang, Yadong Wu, Weihan Zhang and Yuling Zhang    
The precise ascertainment of stellar ages is pivotal for astrophysical research into stellar characteristics and galactic dynamics. To address the prevalent challenges of suboptimal accuracy in stellar age determination and limited proficiency in apprehe... ver más
Revista: Applied Sciences

 
Jin-Woo Kong, Byoung-Doo Oh, Chulho Kim and Yu-Seop Kim    
Intracerebral hemorrhage (ICH) is a severe cerebrovascular disorder that poses a life-threatening risk, necessitating swift diagnosis and treatment. While CT scans are the most effective diagnostic tool for detecting cerebral hemorrhage, their interpreta... ver más
Revista: Applied Sciences