Redirigiendo al acceso original de articulo en 15 segundos...
Inicio  /  Applied Sciences  /  Vol: 9 Par: 9 (2019)  /  Artículo
ARTÍCULO
TITULO

Regularized Urdu Speech Recognition with Semi-Supervised Deep Learning

Mohammad Ali Humayun    
Ibrahim A. Hameed    
Syed Muslim Shah    
Sohaib Hassan Khan    
Irfan Zafar    
Saad Bin Ahmed and Junaid Shuja    

Resumen

Automatic Speech Recognition, (ASR) has achieved the best results for English, with end-to-end neural network based supervised models. These supervised models need huge amounts of labeled speech data for good generalization, which can be quite a challenge to obtain for low-resource languages like Urdu. Most models proposed for Urdu ASR are based on Hidden Markov Models (HMMs). This paper proposes an end-to-end neural network model, for Urdu ASR, regularized with dropout, ensemble averaging and Maxout units. Dropout and ensembles are averaging techniques over multiple neural network models while Maxout are units in a neural network which adapt their activation functions. Due to limited labeled data, Semi Supervised Learning (SSL) techniques are also incorporated to improve model generalization. Speech features are transformed into a lower dimensional manifold using an unsupervised dimensionality-reduction technique called Locally Linear Embedding (LLE). Transformed data along with higher dimensional features is used to train neural networks. The proposed model also utilizes label propagation-based self-training of initially trained models and achieves a Word Error Rate (WER) of 4% less than that reported as the benchmark on the same Urdu corpus using HMM. The decrease in WER after incorporating SSL is more significant with an increased validation data size.

 Artículos similares

       
 
Shurong Peng, Lijuan Guo, Haoyu Huang, Xiaoxu Liu and Jiayi Peng    
The integration of large-scale wind power into the power grid threatens the stable operation of the power system. Traditional wind power prediction is based on time series without considering the variability between wind turbines in different locations. ... ver más
Revista: Applied Sciences

 
Tatyana Aksenovich and Vasiliy Selivanov    
During geomagnetic storms, which are a result of solar wind?s interaction with the Earth?s magnetosphere, geomagnetically induced currents (GICs) begin to flow in the long, high-voltage electrical networks on the Earth?s surface. It causes a number of ne... ver más
Revista: Applied Sciences

 
Wendimu Fanta Gemechu, Wojciech Sitek and Gilmar Ferreira Batalha    
Revista: Applied Sciences

 
Qiyan Li, Zhi Weng, Zhiqiang Zheng and Lixin Wang    
The decrease in lake area has garnered significant attention within the global ecological community, prompting extensive research in remote sensing and computer vision to accurately segment lake areas from satellite images. However, existing image segmen... ver más
Revista: Applied Sciences

 
Shuting Xu and Jinming Xu    
The construction of deep foundation pits in subway stations can affect the settlement of existing buildings adjacent to the pits to varying degrees. In this paper, the Long Short-Term Memory neural network prediction model of building settlement caused b... ver más
Revista: Applied Sciences