Inicio  /  Applied Sciences  /  Vol: 9 Par: 9 (2019)  /  Artículo
ARTÍCULO
TITULO

Regularized Urdu Speech Recognition with Semi-Supervised Deep Learning

Mohammad Ali Humayun    
Ibrahim A. Hameed    
Syed Muslim Shah    
Sohaib Hassan Khan    
Irfan Zafar    
Saad Bin Ahmed and Junaid Shuja    

Resumen

Automatic Speech Recognition, (ASR) has achieved the best results for English, with end-to-end neural network based supervised models. These supervised models need huge amounts of labeled speech data for good generalization, which can be quite a challenge to obtain for low-resource languages like Urdu. Most models proposed for Urdu ASR are based on Hidden Markov Models (HMMs). This paper proposes an end-to-end neural network model, for Urdu ASR, regularized with dropout, ensemble averaging and Maxout units. Dropout and ensembles are averaging techniques over multiple neural network models while Maxout are units in a neural network which adapt their activation functions. Due to limited labeled data, Semi Supervised Learning (SSL) techniques are also incorporated to improve model generalization. Speech features are transformed into a lower dimensional manifold using an unsupervised dimensionality-reduction technique called Locally Linear Embedding (LLE). Transformed data along with higher dimensional features is used to train neural networks. The proposed model also utilizes label propagation-based self-training of initially trained models and achieves a Word Error Rate (WER) of 4% less than that reported as the benchmark on the same Urdu corpus using HMM. The decrease in WER after incorporating SSL is more significant with an increased validation data size.

 Artículos similares

       
 
Dibo Dong, Shangwei Wang, Qiaoying Guo, Yiting Ding, Xing Li and Zicheng You    
Predicting wind speed over the ocean is difficult due to the unequal distribution of buoy stations and the occasional fluctuations in the wind field. This study proposes a dynamic graph embedding-based graph neural network?long short-term memory joint fr... ver más

 
María Gema Carrasco-García, María Inmaculada Rodríguez-García, Juan Jesús Ruíz-Aguilar, Lipika Deka, David Elizondo and Ignacio José Turias Domínguez    
Hyperspectral technology has been playing a leading role in monitoring oil spills in marine environments, which is an issue of international concern. In the case of monitoring oil spills in local areas, hyperspectral technology of small dimensions is the... ver más

 
Shaoyan Zuo, Dazhi Wang, Xiao Wang, Liujia Suo, Shuaiwu Liu, Yongqing Zhao and Dewang Liu    
In this study, a deep learning network for extracting spatial-temporal features is proposed to estimate significant wave height (???? H s ) and wave period (???? T s ) from X-band marine radar images. Since the shore-based radar image in this study is in... ver más

 
Sheng Zhang, Guangzhong Liu and Chen Cheng    
Over the past few decades, unmanned surface vehicles (USV) have drawn a lot of attention. But because of the wind, waves, currents, and other sporadic disturbances, it is challenging to understand and collect correct data about USV dynamics. In this pape... ver más

 
Min Xu, Wenjie Tian and Xiangpeng Zhang    
The three-degrees-of-freedom (3-DOF) parallel robot is commonly employed as a shipborne stabilized platform for real-time compensation of ship disturbances. Pose accuracy is one of its most critical performance indicators. Currently, neural networks have... ver más