Redirigiendo al acceso original de articulo en 18 segundos...
ARTÍCULO
TITULO

Development of the algorithm of keyword search in the Kazakh language text corpus

Akerke Akanova    
Nazira Ospanova    
Yevgeniya Kukharenko    
Gulmira Abildinova    

Resumen

The issue of semantic text analysis occupies a special place in computational linguistics. Researchers in this field have an increased interest in developing an algorithm that will improve the quality of text corpus processing and probabilistic determination of text content. The results of the study on the application of methods, approaches, algorithms for semantic text analysis in computational linguistics in international and Kazakhstan science led to the development of an algorithm of keyword search in a Kazakh text. The first step of the algorithm was to compile a reference dictionary of keywords for the Kazakh language text corpus. The solution to this problem was to apply the Porter (stemmer) algorithm for the Kazakh language text corpus. The implementation of the stemmer allowed highlighting unique word stems and getting a reference dictionary, which was subsequently indexed. The next step is to collect learning data from the text corpus. To calculate the degree of semantic proximity between words, each word is assigned a vector of the corresponding word forms of the reference dictionary, which results in a pair of a keyword and a vector. And the last step of the algorithm is neural network learning. During learning, the error backpropagation method is used, which allows a semantic analysis of the text corpus and obtaining a probabilistic number of words close to the expected number of keywords. This process automates the processing of text material by creating digital learning models of keywords. The algorithm is used to develop a neurocomputer system that will automatically check the text works of online learners. The uniqueness of the keyword search algorithm is the use of neural network learning for texts in the Kazakh language. In Kazakhstan, scientists in the field of computational linguistics conducted a number of studies based on morphological analysis, lemmatization and other approaches and implemented linguistic tools (mainly translation dictionaries). The scope of neural network learning for parsing of the Kazakh language remains an open issue in the Kazakhstan science.The developed algorithm involves solving one of the problems of effective semantic analysis of the text in the Kazakh language

 Artículos similares

       
 
Andry Sedelnikov, Evgenii Kurkin, Jose Gabriel Quijada-Pioquinto, Oleg Lukyanov, Dmitrii Nazarov, Vladislava Chertykovtseva, Ekaterina Kurkina and Van Hung Hoang    
This paper describes the development of a methodology for air propeller optimization using Bezier curves to describe blade geometry. The proposed approach allows for more flexibility in setting the propeller shape, for example, using a variable airfoil o... ver más
Revista: Computation

 
Evangelos Filippou, Spyridon Kilimtzidis, Athanasios Kotzakolios and Vassilis Kostopoulos    
The pursuit of more efficient transport has led engineers to develop a wide variety of aircraft configurations with the aim of reducing fuel consumption and emissions. However, these innovative designs introduce significant aeroelastic couplings that can... ver más
Revista: Aerospace

 
Feng Cheng, Shuchun Jia and Wei Gao    
In order to tackle the issue of carbon emissions in logistics and distribution, a vehicle routing model was proposed with the aim of minimizing the overall cost, which includes the vehicle?s fixed cost, transportation costs, and carbon emission costs. An... ver más
Revista: Applied Sciences

 
Achini Adikari, Su Nguyen, Rashmika Nawaratne, Daswin De Silva and Damminda Alahakoon    
The proliferation of online hotel review platforms has prompted decision-makers in the hospitality sector to acknowledge the significance of extracting valuable information from this vast source. While contemporary research has primarily focused on extra... ver más
Revista: Applied Sciences

 
Jin Li, Tao Han, Wenyang Guan and Xiaoqin Lian    
With the development and popularization of Intelligent Transportation Systems (ITS), Vehicle Ad-Hoc Networks (VANETs) have attracted extensive attention as a key technology. In order to achieve real-time monitoring, VANET technology enables vehicles to c... ver más
Revista: Applied Sciences