Inicio  /  Information  /  Vol: 14 Par: 7 (2023)  /  Artículo
ARTÍCULO
TITULO

Scene Text Recognition Based on Improved CRNN

Wenhua Yu    
Mayire Ibrayim and Askar Hamdulla    

Resumen

Text recognition is an important research topic in computer vision. Scene text, which refers to the text in real scenes, sometimes needs to meet the requirement of attracting attention, and there is the situation such as deformation. At the same time, the image acquisition process is affected by factors such as occlusion, noise, and obstruction, making scene text recognition tasks more challenging. In this paper, we improve the CRNN model for text recognition, which has relatively low accuracy, poor performance in recognizing irregular text, and only considers obtaining text sequence information from a single aspect, resulting in incomplete information acquisition. Firstly, to address the problems of low text recognition accuracy and poor recognition of irregular text, we add label smoothing to ensure the model?s generalization ability. Then, we introduce the smoothing loss function from speech recognition into the field of text recognition, and add a language model to increase information acquisition channels, ultimately achieving the goal of improving text recognition accuracy. This method was experimentally verified on six public datasets and compared with other advanced methods. The experimental results show that this method performs well in most benchmark tests, and the improved model outperforms the original model in recognition performance.

 Artículos similares

       
 
Yuan Li, Mayire Ibrayim and Askar Hamdulla    
In the last years, methods for detecting text in real scenes have made significant progress with an increase in neural networks. However, due to the limitation of the receptive field of the central nervous system and the simple representation of text by ... ver más
Revista: Information

 
Minjun Jeon and Young-Seob Jeong    
Scene text detection is the task of detecting word boxes in given images. The accuracy of text detection has been greatly elevated using deep learning models, especially convolutional neural networks. Previous studies commonly aimed at developing more ac... ver más
Revista: Applied Sciences

 
Shiwei Chen, Dayue Yao, Huiliang Cao and Chong Shen    
Action and identification problems are the challenges that visually impaired people often encounter in their lives. The high price of existing commercial intelligent auxiliary equipment has placed enormous economic pressure on most visually impaired peop... ver más
Revista: Applied Sciences

 
Kobie Van Krieken    
News stories aim to create an immersive reading experience by virtually transporting the audience to the described scenes. In print journalism, this experience is facilitated by text-linguistic narrative techniques, such as detailed scene reconstructions... ver más
Revista: Information