Inicio  /  Applied Sciences  /  Vol: 9 Par: 9 (2019)  /  Artículo
ARTÍCULO
TITULO

Regularized Urdu Speech Recognition with Semi-Supervised Deep Learning

Mohammad Ali Humayun    
Ibrahim A. Hameed    
Syed Muslim Shah    
Sohaib Hassan Khan    
Irfan Zafar    
Saad Bin Ahmed and Junaid Shuja    

Resumen

Automatic Speech Recognition, (ASR) has achieved the best results for English, with end-to-end neural network based supervised models. These supervised models need huge amounts of labeled speech data for good generalization, which can be quite a challenge to obtain for low-resource languages like Urdu. Most models proposed for Urdu ASR are based on Hidden Markov Models (HMMs). This paper proposes an end-to-end neural network model, for Urdu ASR, regularized with dropout, ensemble averaging and Maxout units. Dropout and ensembles are averaging techniques over multiple neural network models while Maxout are units in a neural network which adapt their activation functions. Due to limited labeled data, Semi Supervised Learning (SSL) techniques are also incorporated to improve model generalization. Speech features are transformed into a lower dimensional manifold using an unsupervised dimensionality-reduction technique called Locally Linear Embedding (LLE). Transformed data along with higher dimensional features is used to train neural networks. The proposed model also utilizes label propagation-based self-training of initially trained models and achieves a Word Error Rate (WER) of 4% less than that reported as the benchmark on the same Urdu corpus using HMM. The decrease in WER after incorporating SSL is more significant with an increased validation data size.

 Artículos similares

       
 
Yong Liu, Xiaohui Yan, Wenying Du, Tianqi Zhang, Xiaopeng Bai and Ruichuan Nan    
The current work proposes a novel super-resolution convolutional transposed network (SRCTN) deep learning architecture for downscaling daily climatic variables. The algorithm was established based on a super-resolution convolutional neural network with t... ver más
Revista: Water

 
Song Xue, Jingyan Chen, Sheng Li and Huaai Huang    
Early warning of safety risks downstream of small reservoirs is directly related to the safety of people?s lives and property and the economic and social development of the region. The lack of data and low collaboration in downstream safety management of... ver más
Revista: Water

 
Mark A. Denisenko, Alina S. Isaeva, Alexander S. Sinyukin and Andrey V. Kovalev    
The fast, convenient, and accurate determination of railroad cars? load mass is critical to ensure safety and allow asset counting in railway infrastructure. In this paper, we propose a method for modeling the mechanical deformations that occur in the ra... ver más
Revista: Infrastructures

 
Fahim Sufi    
In the face of escalating cyber threats that have contributed significantly to global economic losses, this study presents a comprehensive dataset capturing the multifaceted nature of cyber-attacks across 225 countries over a 14-month period from October... ver más
Revista: Information

 
Zeyu Xu, Wenbin Yu, Chengjun Zhang and Yadang Chen    
In the era of noisy intermediate-scale quantum (NISQ) computing, the synergistic collaboration between quantum and classical computing models has emerged as a promising solution for tackling complex computational challenges. Long short-term memory (LSTM)... ver más
Revista: Information