Inicio  /  Applied Sciences  /  Vol: 12 Par: 23 (2022)  /  Artículo
ARTÍCULO
TITULO

Applying a Character-Level Model to a Short Arabic Dialect Sentence: A Saudi Dialect as a Case Study

Tahani Alqurashi    

Resumen

Arabic dialect identification (ADI) has recently drawn considerable interest among researchers in language recognition and natural language processing fields. This study investigated the use of a character-level model that is effectively unrestricted in its vocabulary, to identify fine-grained Arabic language dialects in the form of short written text. The Saudi dialects, particularly the four main Saudi dialects across the country, were considered in this study. The proposed ADI approach consists of five main phases, namely dialect data collection, data preprocessing and labelling, character-based feature extraction, deep learning character-based model/classical machine learning character-based models, and model evaluation performance. Several classical machine learning methods, including logistic regression, stochastic gradient descent, variations of the naive Bayes models, and support vector classification, were applied to the dataset. For the deep learning, the character convolutional neural network (CNN) model was adapted with a bidirectional long short-term memory approach. The collected data were tested under various classification tasks, including two-, three- and four-way ADI tasks. The results revealed that classical machine learning algorithms outperformed the CNN approach. Moreover, the use of the term frequency?inverse document frequency, combined with a character n-grams model ranging from unigrams to four-grams achieved the best performance among the tested parameters.

 Artículos similares

       
 
Jiahao Fan and Weijun Pan    
In recent years, automatic speech recognition (ASR) technology has improved significantly. However, the training process for an ASR model is complex, involving large amounts of data and a large number of algorithms. The task of training a new model for a... ver más
Revista: Aerospace

 
Zhifu Lin, Dasheng Xiao and Hong Xiao    
Flow through complex thermodynamic machinery is intricate, incorporating turbulence, compressibility effects, combustion, and solid?fluid interactions, posing a challenge to classical physics. For example, it is not currently possible to simulate a three... ver más
Revista: Aerospace

 
Javensius Sembiring, Rianto Adhy Sasongko, Eduardo I. Bastian, Bayu Aji Raditya and Rayhan Ekananto Limansubroto    
This paper investigates the development of a deep learning-based flight control model for a tilt-rotor unmanned aerial vehicle, focusing on altitude, speed, and roll hold systems. Training data is gathered from the X-Plane flight simulator, employing a p... ver más
Revista: Aerospace

 
Bocheng Zhao, Mingying Huo, Ze Yu, Naiming Qi and Jianfeng Wang    
In this study, we propose an aerial rendezvous method to facilitate the recovery of unmanned aerial vehicles (UAVs) using carrier aircrafts, which is an important capability for the future use of UAVs. The main contribution of this study is the developme... ver más
Revista: Aerospace

 
Minxing Dong, Jichao Yang, Yushan Fu, Tengfei Fu, Qing Zhao, Xuelei Zhang, Qinzeng Xu and Wenquan Zhang    
The soft coral order Alcyonacea is a common coral found in the deep sea and plays a crucial role in the deep-sea ecosystem. This study aims to predict the distribution of Alcyonacea in the western Pacific Ocean using four machine learning-based species d... ver más