ARTÍCULO
TITULO

Using regular expressions to process text data

S.V. Kozlov    
A. V. Svetlakov    

Resumen

This article describes the use of regular expressions in solving syntactic and lexical analysis problems. The concept of a regular expression is given, its essence is briefly described. The authors set three main tasks of using regular expressions in software applications. The first of these is to check text messages for compliance with a given input field template. The solution of this problem allows you to verify the data and systematize them in the information system in a uniform form. The second task is to analyze blocks of text when entering data into them. The solution to this problem allows you to identify text fragments entered with errors, examine them and make an appropriate replacement according to the specified rules. The third task determines the direction of using regular expressions when writing translators in modern instrumental environments. Its solution opens up the possibility of developing interpreters and frequency dictionaries for lexical and syntactic text analysis. For each of the described tasks, corresponding examples of computer programs are given. The authors demonstrate the implementation of regular expressions in program code written in the C# programming language, their own developed text data analysis applications. The relevance of the article is related to the study of methods for syntactic and lexical analysis of information flows in text pattern recognition systems, which are effectively used as artificial intelligence tools.

 Artículos similares

       
 
Hongguo Ren, Ziming Zheng, Jing Zhang, Qingqin Wang and Yujun Wang    
Urban landscape parks play a crucial role in providing recreational opportunities for citizens. Different types of landscapes offer varying levels of comfort experiences. However, the assessment of landscape comfort primarily relies on subjective evaluat... ver más
Revista: Applied Sciences

 
Mu-Lin Huang, Xue-Mei He, Ming-Yue Du, Peng-Fei Jiang and Xue-Feng Wang    
High-temperature and high-pressure (HTHP)-treated diamonds have attracted attention all over the world due to their vivid colors. In order to explore a new method for the rapid and non-destructive identification of HTHP-treated diamonds, in this paper, f... ver más
Revista: Applied Sciences

 
Rossana Caroni, Monica Pinardi, Gary Free, Daniela Stroppiana, Lorenzo Parigi, Giulio Tellina, Mariano Bresciani, Clément Albergel and Claudia Giardino    
A study was carried out to investigate the effects of wildfires on lake water quality using a source dataset of 2024 lakes worldwide, covering different lake types and ecological settings. Satellite-derived datasets (Lakes_cci and Fire_cci) were used and... ver más
Revista: Applied Sciences

 
Rafal Michalski and Szymon Zaleski    
Although there have been some studies on the success factors for IT software projects, there is still a lack of coherent research on the success factors for IT service projects. Therefore, this study aimed to identify and understand the factors and their... ver más
Revista: Information

 
Moatsum Alawida, Bayan Abu Shawar, Oludare Isaac Abiodun, Abid Mehmood, Abiodun Esther Omolara and Ahmad K. Al Hwaitat    
The Chat Generative Pre-training Transformer (GPT), also known as ChatGPT, is a powerful generative AI model that can simulate human-like dialogues across a variety of domains. However, this popularity has attracted the attention of malicious actors who ... ver más
Revista: Information