Redirigiendo al acceso original de articulo en 17 segundos...
Inicio  /  Applied Sciences  /  Vol: 9 Par: 9 (2019)  /  Artículo
ARTÍCULO
TITULO

Regularized Urdu Speech Recognition with Semi-Supervised Deep Learning

Mohammad Ali Humayun    
Ibrahim A. Hameed    
Syed Muslim Shah    
Sohaib Hassan Khan    
Irfan Zafar    
Saad Bin Ahmed and Junaid Shuja    

Resumen

Automatic Speech Recognition, (ASR) has achieved the best results for English, with end-to-end neural network based supervised models. These supervised models need huge amounts of labeled speech data for good generalization, which can be quite a challenge to obtain for low-resource languages like Urdu. Most models proposed for Urdu ASR are based on Hidden Markov Models (HMMs). This paper proposes an end-to-end neural network model, for Urdu ASR, regularized with dropout, ensemble averaging and Maxout units. Dropout and ensembles are averaging techniques over multiple neural network models while Maxout are units in a neural network which adapt their activation functions. Due to limited labeled data, Semi Supervised Learning (SSL) techniques are also incorporated to improve model generalization. Speech features are transformed into a lower dimensional manifold using an unsupervised dimensionality-reduction technique called Locally Linear Embedding (LLE). Transformed data along with higher dimensional features is used to train neural networks. The proposed model also utilizes label propagation-based self-training of initially trained models and achieves a Word Error Rate (WER) of 4% less than that reported as the benchmark on the same Urdu corpus using HMM. The decrease in WER after incorporating SSL is more significant with an increased validation data size.

 Artículos similares

       
 
Pengyun Chen, Zhiru Li, Guangqing Liu, Ziyi Wang, Jiayu Chen, Shangyao Shi, Jian Shen and Lizhou Li    
The positioning results of terrain matching in flat terrain areas will significantly deteriorate due to the influence of terrain nonlinearity and multibeam measurement noise. To tackle this problem, this study presents the Pulse-Coupled Neural Network (P... ver más

 
Pengfei Ning, Dianjun Zhang, Xuefeng Zhang, Jianhui Zhang, Yulong Liu, Xiaoyi Jiang and Yansheng Zhang    
The Array for Real-time Geostrophic Oceanography (Argo) program provides valuable data for maritime research and rescue operations. This paper is based on Argo historical and satellite observations, and inverted sea surface and submarine drift trajectori... ver más

 
Min Xu, Wenjie Tian and Xiangpeng Zhang    
The three-degrees-of-freedom (3-DOF) parallel robot is commonly employed as a shipborne stabilized platform for real-time compensation of ship disturbances. Pose accuracy is one of its most critical performance indicators. Currently, neural networks have... ver más

 
Shun Wang, Jiayan Wang, Zhikang Xu, Ji Wang, Rui Li and Jinliang Dai    
The application of titanium alloy in shipbuilding can reduce ship weight and carbon emissions. To solve the problem of titanium alloy forming, the deformation prediction of titanium alloy line heating based on a backpropagation (BP) neural network and sp... ver más

 
Yifan Shang, Wanneng Yu, Guangmiao Zeng, Huihui Li and Yuegao Wu    
Image recognition is vital for intelligent ships? autonomous navigation. However, traditional methods often fail to accurately identify maritime objects? spatial positions, especially under electromagnetic silence. We introduce the StereoYOLO method, an ... ver más